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Power to the people 


A planned database collating medical information for England’s population is a laudable exercise, 


with huge potential for research. But people’s right to opt out has been greatly downplayed. 


bombard all 26.5 million households in the country with an 
innocuous-looking information flyer. In soothing tones the 
leaflet, entitled ‘Better Information Means Better Care, announces 
changes to the way that health officials will handle confidential medi- 
cal records to “improve the quality of care and health services for all”. 

The name of the programme is care.data — not that you would 
know that from the leaflet, which neglects to mention it. From this 
spring, information from medical consultations, for example on diag- 
noses and treatments prescribed, that was once confidential between a 
person and their doctor, will be uploaded to a central database. There 
it will be combined with hospital and other medical records. This will 
become one of the world’s most complete databases on the health care 
of patients. Initially, the data will be used to help health authorities 
to manage NHS resources, but the plan is to eventually open up the 
database to researchers and private companies. The importance for 
research and the medical opportunities afforded by such a unique 
joined-up resource cannot be overestimated. Nature fully supports 
such an endeavour, and this journal spoke out in May 2013 when pro- 
posed changes to European data-protection laws threatened the use- 
fulness of projects such as this latest one (see Nature 497, 287; 2013). 

The key, as always, is consent. The information at stake here is not 
genomic clues to future health risks — already the subject of fierce 
debate — but sensitive data on past and current medical conditions. 
What the government leaflet fails to highlight is the real threat to pri- 
vacy and the possible consequences. Worse, the public-relations exer- 
cise carried out by the government to stress the programme's benefits 
has if anything increased the backlash from privacy campaigners, who 
are now highlighting the risks and urging people not to participate. 
An unfortunate false choice has been established, between scientific 
progress on one side and protection of privacy on the other. 

The government did not initially intend even to allow individuals to 
opt out of having their data centralized in this way, which would have 
flown in the face of the most basic principles of privacy and informed 
consent. The leaflet now states, “You have a choice’, but the govern- 
ment seems to have made it as difficult as possible for people to exer- 
cise that right. They must explicitly contact their local doctor to opt 
out — a requirement that seems a sure way to make certain that most 
wont bother, and so will be opted in by default. UK medical charities, 
including the Wellcome Trust, have launched their own advertising 
campaign in support of care.data, which, although it validly highlights 
many of the research opportunities of such big data, also fails to men- 
tion sufficiently prominently that an opt-out option exists, and indeed 
seems intended to try to reduce the number of people who opt out. 

Maximizing the number of people entering the programme is 
clearly a noble goal. But one cannot help but get the uncomfortable 
impression that, in their enthusiasm to amass these data, the authori- 
ties are using sleight of hand and paying lip service to the principles of 


Te month, the National Health Service (NHS) in England will 


informed consent. Inconvenient as it may be, and even if it has some 
negative effects on the utility of the database, the opt-out option to 
care.data should be prominently displayed, and facilitated. 

The public-relations campaigns also break the first rule of risk com- 
munication, which is to state clearly any potential, even if remote, risks. 
They are far too reassuring, for example, that people’s personal data are 

in safe hands and will be well protected from 


“The removalof abuse. Under the programme, personally 
identifiers is an identifiable data will be stored securely by the 
essentialstepin — Health and Social Care Information Centre 
protecting data, in Leeds, which will review requests for them. 
but itis farfrom Most will be made available only after being 


pseudonymized — a process by which data 
are stripped of information that would oth- 
erwise easily allow identification of the data’s personal provenance. 

This removal of identifiers is an essential step in protecting data, but 
it is far from foolproof, and a determined effort can often re-identify 
pseudonymized data. Furthermore, the Health and Social Care Infor- 
mation Centre’s store of personally identifiable data is not immune 
from hacking or other intrusions. No doubt much thought has gone 
into protecting the data to high standards, but overly reassuring the 
population that its personal data are safe is an invitation to public 
disillusionment in the system down the road. 

The potential gains for health authorities and researchers from 
patient-level data are immense. But both should be insisting on the 
spirit of informed consent — clearer, more-upfront information and 
greater visibility given to people's right to opt out would bea good start. m 


foolproof.” 


Cool heads needed 


As cold weather rages, itis easy to forget the 
difference between weather and climate. 


last month, with harsh winds, downpours and flooding that 

extended into the new year. Across the Atlantic, North Ameri- 
cans found themselves hunkering indoors — or stuck in airports — as 
amass of Arctic air delivered snow and record low temperatures that 
brought services to a halt. As tends to happen whenever the weather 
goes haywire, many wondered whether larger forces were at work; 
British Prime Minister David Cameron said he suspected that global 
warming was partly to blame for his country’s suffering. It is a natural 
assumption, and scientists are actively engaging with the issue. But, 
as always, a little caution is in order. 


Tis United Kingdom had its stormiest December for 50 years 
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On some level, most people understand the difference between 
climate and weather. Climate is the context: the accumulation of tem- 
peratures and precipitation trends that vary depending on location 
and season. Weather is what we experience, and extremes are part of 
the package. This was the message delivered by the UK Met Office, 
which pointed out that stormy conditions are more likely during win- 
ter months. Despite such assessments, however, people continually 
confound weather and climate in the heat — or cold — of the moment. 
Confusion seems unavoidable. 

In the United States, the cold snap extended as far south as Florida, 
forcing thousands of flight cancellations at the height of the holiday 
season. Climate sceptics celebrated, apparently unable or unwilling to 
accept that even a warming planet experiences cold temperatures. A 
small cohort of scientists countered with arguments that global warm- 
ing might in fact be contributing to the string of abnormally cold US 
winters in recent years. The argument is that rapid Arctic warming and 
melting sea ice are destabilizing the fast-flowing air current known 
as the polar jet stream, leading it to the kind of drunken meandering 
that can push Arctic air across North America — and deliver power- 
ful storms to the United Kingdom. If that is true, Cameron may well 
have been right. 

Evidence for the claim that global warming could be disrupting the 
jet stream is disputed. Similar weather events have happened in the 
past, and at least one review of the record suggests that nothing is amiss 
— at least nothing that scientists can pin down as obviously outside 
the normal year-to-year seasonal variations. This does not mean that 
climate change has no role, of course. It just means that we do not yet 
know. In the words of one climate modeller, until the models and the 
observations align, we ought to reserve judgement. As far as the public 
is concerned, there is little to do but dress appropriately, keep an open 
mind and let the science play out. 

The same dynamic has been playing out in recent years with regard 
to the average global temperature, which has plateaued since 1998. At 


first blush, the global-warming ‘hiatus’ runs counter to the warming 
projected by climate models. Here again, climate sceptics have 
pounced, and some climate scientists have rightly begun to explore 
both the climate system and their models to sort out the apparent 
discrepancy. As reported on page 276, researchers are homing in on 
a potential explanation that ties the periodic warming and cooling 
of the eastern equatorial Pacific Ocean to global temperature trends. 

In particular, the cool phase of the Pacific 


“There are many — Decadal Oscillation — which took hold in 
ways to estimate — 1998, coinciding precisely with the hiatus — 
the climate’s seems to drive heat into the ocean, effectively 
likely response cooling the atmosphere. 

to greenhouse Plenty of questions remain. According to 
gases, and the this theory, temperatures will rise anew when 
evidence cuts the eastern Pacific flips into its warm phase 


in the coming years. But how much warming 
should we expect when that happens? Exactly 
how sensitive is Earth’s climate system to increasing atmospheric levels 
of greenhouse gases? Some have argued, in part on the basis of current 
temperature trends, that climate models tend to overestimate warm- 
ing, which would be good news indeed if true. But there are many ways 
to estimate the climate’s likely response to greenhouse gases, and the 
evidence cuts both ways. 

Ultimately, the hiatus has provided an opportunity to better under- 
stand both the climate system and climate models. One lesson is that 
the climate, like day-to-day weather, has its ups and downs. Another is 
that the average global temperature — although a useful indicator — is 
not the only measure of how the climate changes. Scientists are still 
trying to work out what all of this means for the future, but if the past 
is any indication, we may have to live with a fair degree of uncertainty. 
From a policy perspective, little has changed. The range of potential 
impacts projected by climate models warrants much more aggressive 
action than has been initiated so far. m 


both ways.” 


V is for vortex 


Anendangered species helps scientists to learn 
why migrating birds fly in a familiar formation. 


widespread sight in the skies of north Africa that the bird was 

immortalized as an ancient Egyptian hieroglyph. The picture 
symbol denoted the word akh, which means ‘to be resplendent, to 
shine’ Ibis populations are less resplendent today, with just a few hun- 
dred of the wild birds remaining, mainly in Morocco. They can still 
shine, however; a study of 14 northern bald ibises reported this week 
on page 399 offers the first experimental evidence that helps to resolve 
one of the great questions of the natural world: why do migrating birds 
often fly in an elegant V formation? 

The obvious answer is that it saves energy. Just as the mass ranks of 
a peloton ina cycle race make life easier for riders, and as tight forma- 
tions can save aircraft fuel, the signature shape ofa flock of ibises or 
geese is assumed to make flight less of a flap — at least for the bulk of 
the birds that follow the leader. (That is another, less obvious, theory 
for the V shape: that the bird at the front is the best navigator.) 

Some of the most influential research studies do little more than test 
whether the obvious answer to a question is the correct one. When it 
comes to bird flight, the validity of the obvious answer has, until now, 
been concealed by an obvious problem. Namely, that the equipment 
for monitoring the flight of wild birds tends to disappear over the 
horizon along with the bird to which it is attached. (Sensors that are 
able to relay the data tend to be too heavy for birds to carry.) 


r | Vhe northern bald ibis (Geronticus eremita) was once sucha 
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This is where the endangered plight of the northern bald ibis offered 
an opportunity to science. Several captive-breeding programmes exist, 
and a big part of preparing the birds for release is to teach them their 
traditional migration routes. Hand-reared ibises are trained to follow 
conservation experts who are inside a microlight aircraft. So, crucially, 
when these birds set off to fly in formation, they come back. 

Steven Portugal, a researcher at the Royal Veterinary College in Hat- 
field, UK, used the training flights of ibises raised at a zoo in Vienna 
to test the benefits of formation flying. His team fitted the birds with 
lightweight data loggers that could measure both their body position 
and flapping movements. 

The juvenile birds took a while to get into shape; a V formation 
is harder to achieve and maintain than it looks, and it looks pretty 
difficult. (RAF pilots told to fly in a tight V shape during the Second 
World War spent more time watching the position of the plane in front 
than scanning for enemy fighters.) Still, the 14 ibises did manage it for 
long enough for the scientists to accurately record both the distance 
between each bird and the timing of the creatures’ wing flaps. 

The results: when in formation, each bird was able to synchro- 
nize the flapping of its wings so that it could exploit the updraught 
created by the swirling vortex of air from the flapping wingtip of 
the bird in front. When the flock got it right, each following bird 
delayed its wingbeat by just enough to spread a wave of synchrony 
through each arm of the V. When they got it wrong and a following 
bird drifted directly behind the bird in front, the follower registered 
the problem and adjusted the timing of its flaps so that it did not 
become tangled in the powerful downdraught 
of the same vortex. For more, see the associated 
News & Views article on page 295. Or look up 
at the sky, and delight in the rare beauty of an 
obvious answer. = 
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One? This new console is significant, and not just for its games 

and graphics. It is fitted with a camera that can monitor the 
heart rate of people sitting in the same room. The sensor is primarily 
designed for exercise games, allowing players to monitor heart changes 
during physical activity, but, in principle, the same type of system 
could monitor and pass on details of physiological responses to TV 
advertisements, horror movies or even party political broadcasts. 

The Xbox One is the first consumer electronic hardware to perma- 
nently integrate a technology called physiological computing. Those 
of us who work in this field aim to transform the way that people use, 
control and interact with electronic devices in everyday life. But like 
all technology it has a darker side, and chief among the concerns is 
the potential infringement of privacy. The mass 
appeal of the Xbox One suggests the probable 
broad reach of such tools, and now is a good 
time to consider both the benefits and risks of 
the technology — particularly in the wake of the 
first international conference devoted to this 
topic, which took place last week in Lisbon (www. 
phycs.org). 

Most people do not consider the human body 
to be a transmitter, but our nervous systems 
constantly generate data — from the first pulse 
ofa fetal heartbeat to a dying breath. Physiolog- 
ical computing converts these data to control 
inputs for a computing system, using the signals 
as a proxy for the conventional keyboard and 
mouse. Brain-computer interfaces, for exam- 
ple, can already move a cursor on a screen in response to electrical 
fluctuations in the brain. 

The same technology can monitor spontaneous activity from the 
brain and body to infer a computer user’s emotional and cognitive 
state. For instance, moods such as anger or frustration can be detected 
from specific changes in cardiovascular activity and breathing pat- 
terns. And increased concentration on difficult mental tasks produces 
characteristic changes in brain activation that can be picked up in an 
electroencephalogram (EEG). 

Scientists want to use these physiological changes to create technology 
that can respond to the circumstances and adjust conditions to improve 
the quality of the human-computer interaction. A desktop computer 
that can recognize frustration from cardiac data could be programmed 
to offer help or even to play soothing music; sensors in a phone could 
spot stress during a fraught journey in heavy traffic or poor weather 
and automatically divert all calls to voicemail. 


H ow many people have beneath their television a shiny Xbox 


This scenario, in which software adaptsinapro- DNATURE.COM 
active and implicit way to dynamic signals from _ Discuss this article 
the user, represents a radical departure from how __ online at: 

we currently interact with computers. go.nature.com/zgwoiz 


INFORMATION 
GATHERED ON 
A PERSON'S 
PHYSIOLOGY SHOULD 
BE CONSIDERED TO BE 


OWNED 


BY THAT PERSON. 


Physiological data must 
remain confidential 


Electronic devices that track our emotions, heart rate or brain waves should be 
regulated to protect individual privacy, says Stephen Fairclough. 


A good example is digital health, in which wireless devices and 
sensors can record physiological activity to offer a wealth of quantita- 
tive information about lifestyle and well-being. These data can reveal 
the impact of changes in exercise or diet on physiological markers such 
as cardiovascular activity. A colleague who wore a chest band non-stop 
for a year to monitor his heart rate learned, for example, how workload 
affected his sleeping patterns. This type of ambulatory measurement 
—and the cumulative gathering of information — delivers big data at 
the level of the individual. 

Until now, the main barrier to developing this technology has been 
the scarcity of sensors that are both inconspicuous and capable of 
delivering high-quality data. But the field of wearable sensors is evolv- 
ing at an extraordinary pace. The traditional image of the laboratory 
participant festooned in wires is being replaced 
by one in which discreet, ambulatory sensors 
stream data to mobile devices. The ubiquity of 
cameras on smartphones means that, with the 
right app, heart rate can be detected from the 
finger or even remotely from the face. As sen- 
sors improve, so too will their public acceptance. 
Their spread, in turn, will boost the quality of the 
data they can generate and the number of uses 
to which they can be applied. For example, con- 
tinuous monitoring of EEGs using ambulatory 
equipment can reveal patterns of brain activity 
characteristic of epilepsy — useful information 
not only for individuals but also for the health- 
insurance industry. And these advances prompt 
questions such as: who owns the data? Who 
should be allowed to gather and store this information? 

As a researcher, I would never monitor the physiology of a person 
in the lab or field without consent. But privacy concerns are real, and 
I think that most people would be more comfortable with this type 
of technology if protection or regulation were in place sooner rather 
than later. Advances in genomics and gene sequencing have raised 
legitimate concerns about the ability of third parties to covertly obtain 
and screen someone’s DNA — taken perhaps from a used coffee cup to 
test for paternity when it is disputed, for instance. (UK law demands 
consent for such tests.) Similarly, the field of physiological computing 
needs to decide on rules and guidelines for researchers and others. 

We are at the start of this debate, but there is one key point that 
should underpin all future discussions. Information gathered on a 
person’s physiology should be considered to be owned by that person. 
The default position must be that these data should be confidential in 
the same way as medical records, for that is what they are. m 


Stephen Fairclough is professor of psychophysiology at Liverpool 
John Moores University, UK, and blogs at physiologicalcomputing.net. 
e-mail: s.fairclough@ljmu.ac.uk 
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Selections from the 
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RESEARCH HIGHLIGHTS 


Warm carbon coat 
reduces friction 


A coating material made of 
carbon reduces friction not just 
by providing a slippery surface, 
but also by keeping the points 
of contact warm. 

Marcus Bjorling of Lulea 
University of Technology in 
Sweden and his team coated 
steel balls with ‘diamond-like 
carbon’ — a material in which 
the carbon atoms have a 
bonding pattern similar to that 
of diamond. They rolled the 
balls against a metal disk with 
an oil lubricant in between, and 
showed that the carbon coating 
acts as an insulator, lowering 
the viscosity of the lubricant 
and thus reducing the friction 
between the ball and the disk. 

The findings could 
encourage the development of 
lubricant coatings made from 
insulating materials. 

Tribol. Lett. http://doi.org/qtm 
(2014) 


Starfish eyes 
see the light 


Starfish can use eyes at the tips 
of their arms to find their way 
home. 

Most sea-star species have 
compound eyes on their arms, 
but there has been no direct 
evidence that the eyes enable 
vision. In field experiments, 
Anders Garm at the University 
of Copenhagen and Dan-Eric 
Nilsson at Lund University in 


Landslide triggered earthquakes 


Large earthquakes often cause landslides, but in 
an unusual reversal, a massive landslide in a US 
copper mine in April 2013 might have resulted in 
a series of small earthquakes. 

Kristine Pankow and her colleagues at the 
University of Utah in Salt Lake City describe 
two sequential rock avalanches at the Bingham 
Canyon mine (pictured) near Salt Lake City. 
Together, these events comprise probably 


Sweden displaced blue starfish 
(Linckia laevigata) from the 
coral reefs they inhabit off the 
coast of Japan. The starfish 
were able to navigate their 

way back to the reef from 
distances of up to two metres, 
an ability that was lost when the 
researchers surgically removed 
the animals’ eyes. 

The distribution and shape 
of the eyes (pictured) and the 
arrangement of their light- 
sensing cells suggest that the 
starfish can recognize the 
reef from only relatively short 
distances. This might help the 
creatures to stay close to home, 
the researchers say. 

Proc. R. Soc. B 281, 20133011 
(2014) 
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the largest non-volcanic landslide to have 
occurred in North America in modern times. 
The proximity of the landslide to a seismic 


monitoring network produced extensive data, 


RNA retrieved 
from intact tissue 


A technique can snatch RNA 
out of a single cell in live, intact 
tissue, revealing the genes 
being expressed by that cell. 

Other methods of single-cell 
RNA retrieval tend to disrupt 
the surrounding tissue before 
the RNA is captured. James 
Eberwine of the University of 
Pennsylvania in Philadelphia 
and his colleagues created a 
molecule called a TIVA tag that 
penetrates cells. 

When hit with a beam of 
light, the tag binds to RNA that 
is being made from its DNA 


which showed that the avalanches had an 
estimated magnitude of about 2.5. In the days 
following the landslide, the sensors detected 16 
more seismic events. 

GSA Today 24, 4-9 (2014) 


template. By adding the tag 

to human and mouse brain 
tissue and shining a laser on 
individual cells, the researchers 
could activate the tag, retrieve 
the RNA and sequence it. 

The approach could reveal 
how a cell’s natural micro- 
environment affects its activity. 
Nature Meth. http://dx.doi. 
org/10.1038/nmeth.2804 
(2014) 


Trilobites ventured 
beyond the ocean 


Contrary to their reputation 
as denizens of the open ocean, 
the extinct creatures known 


RAVELL CALL/DESERET NEWS/AP 
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as trilobites may have crawled 
up on to tidal flats. 

Fossils found by Gabriela 
Mangano at Canada’s 
University of Saskatchewan 
and her colleagues in rock 
deposits from ancient tidal 
flats reveal that trilobites 
took to the land during the 
Cambrian explosion some 
540 million years ago, when 
the number of animal species 
increased drastically. 

The team discovered the 
fossils along with fossilized 
tracks in rocks from the US 
Appalachian Mountains. The 
rocks also showed signs of 
cracks from periodic drying, 
hinting that they originated 
in a tidal flat zone. 

The finding supports the 
idea that terrestrial creatures 
evolved from marine rather 
than freshwater ancestors. 
Intertidal zones could 
have provided food or safe 
havens for these animals, the 
researchers suggest. 

Geology http://doi.org/qnq 
(2013) 


How exercise 
benefits the body 


A small molecule produced by 
muscles in response to exercise 
boosts metabolism in other 
tissues. 

Robert Gerszten of 
Massachusetts General 
Hospital in Boston and his 
colleagues discovered the 
molecule, BAIBA, when they 
forced muscle cells to express 
the metabolic regulator 
PGC-1a — levels of which 
increase with exercise. BAIBA 
levels increased in exercising 
mice. In animals treated 
with BAIBA, white fat tissue 
showed greater expression 
of genes linked to calorie 
burning, and the mice gained 
less weight and had better 
glucose metabolism than 
untreated mice. 

The researchers also found 
an inverse association in 
humans between BAIBA levels 
and heart-disease risk factors 
— people with more BAIBA in 
their blood also had decreased 
cholesterol levels and less 


insulin resistance, for instance. 
BAIBA could be a target for 
drugs that treat diabetes and 
other metabolic disorders, the 
authors say. 

Cell Metab. 19, 96-108 (2014) 


Past warmth drives 
glacial melting 


The world’s glaciers will 
probably continue to shrink 
over the next decades, 
irrespective of the magnitude 
of future warming. 

Ben Marzeion at the 
University of Innsbruck in 
Austria and his colleagues ran 
a global glacier model using 
various twenty-first-century 
scenarios for greenhouse-gas 
concentration. They found 
only small changes in the loss 
of mass from glaciers under 
greatly different climate- 
change conditions. 

Projected glacier melting 
this century is essentially a 
delayed response to climate 
changes in the twentieth 
century, the authors say. The 
thinning and retreat of glaciers 
from low-lying areas make 
them less sensitive to future 
warmer temperatures, they 
conclude. 

Cryosphere 8, 59-71 (2014) 


MICROBIOLOGY 


Marine bacteria 
shed tiny sacs 


The most abundant 
photosynthetic bacterium 
in the oceans casts off many 
minute pieces of itself every 
day, amounting collectively 
to tonnes of material that 
potentially influences the 
global carbon cycle. 

Many bacterial species 
release membrane-bound 
sacs called vesicles, which 
have not been well studied 
in natural ecosystems. Sallie 
Chisholm, Steven Biller 
and their colleagues at the 
Massachusetts Institute of 
Technology in Cambridge 
discovered vesicles in 
laboratory cultures of the 
microbe Prochlorococcus, 
and in samples from 
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Why penguins do the wave 


> HIGHLY READ 
on lopscience.lop.org 
9 Dec-8 Jan 


Physicists have explained how waves 
of coordinated motion sweep through 
huddles of male emperor penguins 


(Aptenodytes forsteri; pictured) as they 

try to keep warm while incubating eggs in the Antarctic. 

Daniel Zitterbart and Richard Gerum at the University of 
Erlangen-Nuremberg, Germany, and their colleagues analysed 
video recordings of penguin huddles and built a mathematical 
model to study the waves. The authors found that any penguin 
taking a step of two centimetres or more within a densely 
packed huddle can trigger ripples of disturbance as nearby 
penguins readjust to keep close (but not too close) to each other. 

The movements were similar to those seen in traffic jams 
in which waves begin at the front of the queue and travel 
backwards. However, in penguin huddles, waves can move in 
multiple directions from any location. 


New J. Phys. 15, 125022 (2013) 


the Atlantic Ocean. 

Analysis revealed that the 
laboratory vesicles contained 
proteins, DNA and RNA, and 
that each Prochlorococcus 
produced two to five 
vesicles per generation. The 
authors estimated that the 
Prochlorococcus sacs could 
be contributing 10* tonnes 
or more of fixed carbon to 
the ocean carbon cycle each 
day. Vesicles might serve to 
decoy attacking viruses away 
from the bacterium and aid in 
gene transfer. 

Science 343, 183-186 (2014) 


Sea-level swings 
get more extreme 


The seasonal rise and fall in sea 
level along the US Gulf coast 
has grown more pronounced 
since the 1990s compared 

with earlier decades, probably 


because of warmer summers 
and colder winters. 

Thomas Wahl and his 
colleagues at the University of 
South Florida in St Petersburg 
compared sea-level 
measurements collected 
between 1900 and 2011 with 
atmospheric data for the Gulf 
of Mexico coastline. They 
found that typical differences 
in sea level between summer 
and winter have increased 
during the past two decades. 

Higher summer sea levels 
could increase the chances of 
hurricane-related flooding, 
and even slight changes in 
both summer and winter sea 
levels may affect sensitive 
ecosystems, the authors say. 
Geophys. Res. Lett. http://doi. 
org/qtd (2014) 
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Power-plant rules 
The US Environmental 
Protection Agency (EPA) 
published a controversial 
rule on 8 January governing 
greenhouse-gas emissions 
from new power plants. 
Advanced natural-gas power 
plants are poised to meet 

the standards, but the rule 
would effectively require new 
coal-fired plants to capture 
and sequester about 40% of 
their emissions — a feat that 
many industry officials have 
criticized as technologically 
infeasible. The EPA will now 
accept public comments on 
the rule while working ona 
second standard governing 
existing power plants. 


Polio progress 


On 13 January, India marked 
its third consecutive year 
without a case of polio, 
clearing the way for the 
World Health Organization 
to certify the southeast 

Asia region as polio-free. 

The achievement is a major 
milestone for India, where 
high population density and 
poor sanitation had enabled 
the poliomyelitis virus to 
spread. Pakistan, Afghanistan 
and Nigeria remain the only 
countries never to interrupt 
transmission of polio, and the 
virus re-emerged last year in 
war-torn Syria and the Horn 
of Africa. 


Novartis woes 


The Japanese health ministry 
filed a criminal complaint 

on 9 January against Swiss 
pharmaceutical firm Novartis, 
claiming that the Basel-based 
company had exaggerated the 
benefits of Diovan (valsartan), 
used to lower blood pressure. 
Advertisements for the best- 
selling drug relied on studies 
showing that it also reduced 


Super-distant galaxies glimpsed 


Astronomers unveiled pictures of the deepest 
galaxy cluster ever imaged at the annual meeting 
of the American Astronomical Society in 
National Harbor, Maryland, which ended on 

9 January. The images from NASA’s Hubble 
Space Telescope are part of the Frontier Fields 
programme, which harnesses the phenomenon 
of ‘gravitational lensing’ (see Nature 497, 
554-556; 2013). The tremendous gravity of large 


the risk of stroke and heart 
attack. But by early 2013, 
some related papers had been 
retracted over flawed data and 
analysis (see go.nature.com/ 
hdfzbi for more). Novartis’s 
Japan unit has acknowledged 
the complaint on its website. 


Cloud computing 
IBM has invested US$1 billion 
in the IBM Watson 

Group, a business unit to 
commercialize artificial- 
intelligence applications that 
can be accessed by customers 
remotely. Located in New 
York City, the group is based 
around Watson, the computer 
that famously won against 
human competitors in a quiz 
show in 2011 (see go.nature. 
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com/u783dz). Watson has 
since been farmed out to 
analyse large data sets, and is 
being tested at the Memorial 
Sloan-Kettering Cancer 
Center in New York City to 
help physicians decide how to 
diagnose and treat patients. 


Space station stays 
As space-agency leaders 
from around the world 
gathered in Washington DC 
to discuss the future of space 
exploration, the US White 
House approved operations 
aboard the International 
Space Station until at least 
2024, extending the previous 
2020 end date. In a joint 


foreground clusters — in this case, Abell 2744 
(pictured) — distorts space, enhancing the 
visibility of more-distant galaxies. Abell 2744, 
which shows hundreds of galaxies as they looked 
3.5 billion years ago, produced gravitational 
lensing that allowed scientists to see background 
galaxies from more than 12 billion years ago. 
Some of the objects captured are 10-20 times 
fainter than any galaxies previously observed. 


announcement on 8 January, 
Charles Bolden, the head of 
NASA, and President Barack 
Obama's chief science adviser 
John Holdren said that the 
decision will enable the 
continuation of short- and 
long-term research, including 
planned human missions to 
an asteroid and Mars. 


Welcome break-up 
Two ships that had been 
stranded in thick ice off 
Antarctica reported finally 
heading for open waters on 

8 January. The Russian vessel 
Akademik Shokalskiy got 
stuck near Commonwealth 
Bay on 24 December during 
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a research voyage, and the 
Chinese icebreaker Xue Long 
became trapped during a 
rescue attempt (see Nature 
505, 133; 2014). Shifting 
weather conditions allowed 
the ships to break free. See 
pages 270 and 291 for more. 


Industrial blast 


An explosion at a chemical 
plant in Japan on 9 January 
killed 5 and injured 12. 

The blast at Mitsubishi 
Materials in Yokkaichi, about 
300 kilometres west of Tokyo, 
occurred while workers 
cleaned a tank used to cool 
gas during the manufacture of 
silicon. The company is still 
investigating the cause of the 
explosion, but some reports 
suggest that residual chlorine 
or hydrogen in the tank may 
have reacted with air. 


Animal activism 


Anti-terrorism police in 

Italy are investigating the 
targeting of four scientists 

at the University of Milan 
who use animals in their 
research. In the early hours 

of 7 January, activists posted 
flyers and graffiti around 

the neighbourhoods where 
the scientists live, giving 

their names, photographs, 
addresses and telephone 
numbers. The flyers described 
the researchers as torturers 
and murderers, and exhorted 
readers to harass the scientists 
by phone. 


TREND WATCH 


Career barriers for young 
scientists vary widely across 
different regions of the globe, 
according to a survey of 
researchers in 12 countries that 
will be published this month 
by the Global Young Academy 
in Berlin. A large fraction 

of respondents in Germany 
reported problems with job 
insecurity, compared with 
other regions (see chart). More 
researchers in Egypt, Pakistan 
and Tunisia cited political 


instability as a career barrier than 


their peers around the world. 


Engineering leader 
The Royal Academy of 
Engineering in London 
announced the nomination 

of Ann Dowling (pictured) 

as its first female president 

on 9 January. Dowling, who 
heads the department of 
engineering at the University 
of Cambridge, UK, became 
the department's first female 
professor in 1993. Her research 
focuses on achieving efficient, 
low-emission combustion and 
developing low-noise vehicles; 
she has received royal honours 
for her services to mechanical 
engineering and science. 

The academy is scheduled 

to elect Dowling formally in 
September. 


AIDS chief 


On 9 January, US President 
Barack Obama nominated 
physician Deborah Birx to 
coordinate the country’s global 
AIDS efforts and administer the 
US President's Emergency Plan 
for AIDS Relief (PEPFAR). 


SHAKY FUTURES 


Birx currently heads the AIDS 
programme at the US Centers 
for Disease Control and 
Prevention in Atlanta, Georgia. 
The PEPFAR programme, 
which receives about 

US$6 billion per year to 
distribute antiretroviral drugs 
and medical care in countries 
affected by AIDS (see Nature 
457, 254-256; 2009), receiveda 
five-year reauthorization in 
December 2013. 


USGS chief 

US President Barack Obama 
announced on 9 January that 
he had nominated Suzette 
Kimball to lead the US 
Geological Survey. Kimball, 
a former deputy-director 

for the agency, has served as 
acting director since February 
2013, when previous head 
Marcia McNutt resigned 

(see go.nature.com/eyn4uu). 
Kimball has also worked 

as an assistant professor of 
environmental sciences at 
the University of Virginia 

in Charlottesville, and as a 
regional chief scientist for the 
US National Park Service. 


Smithsonian head 


Plant pathologist Eva Pell 
announced on 8 January 

that she will step down as 
undersecretary for science at 
the Smithsonian Institution 
in Washington DC, less than 
four months after Wayne 
Clough announced plans to 
retire as the institution's leader 


Job insecurity causes problems for early-career scientists around 
the world, especially in Germany, according to a recent survey. 
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SEVEN DAYS | THIS WEEK | 


20 JANUARY 

The European Space 
Agency’s Rosetta 
spacecraft comes 

out of hibernation in 
preparation for reaching 
its destination later 
this year: the comet 
67P/Churyumov- 
Gerasimenko. The 
mission will be the 
first to land a probe on 
acomet’s surface. See 
page 269 for more. 
go.nature.com/ivxmmo 


22 JANUARY 

The European Union is 
set to unveil a package 
of long-term climate 
and energy goals and 
proposals. 


(see Nature 501, 467; 2013). 
Since 2010, Pell has headed 
the Smithsonian's science 
museums, its nine research 
centres and the National Zoo. 
Previously, she was senior 
vice-president for research 
and graduate-school dean at 
Pennsylvania State University 
in University Park. 


Pesticide risks 


On 8 January, the US 
Environmental Protection 
Agency announced the award 
of nearly US$500,000 in grants 
for research to reduce the risks 
of pesticides, especially to 
bees. Scientists at Pennsylvania 
State University in University 
Park received funding to study 
alternatives to treating seeds 
with neonicotinoids, a class of 
pesticide linked to declines in 
bee populations (see Nature 
496, 408; 2013). At Louisiana 
State University in Baton 
Rouge, researchers will assess 
the long-term risks to bees 
from chemicals used in large- 
scale mosquito-abatement 
programmes. 


> NATURE.COM 
For daily news updates see: 
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Growing dismay A better definition Whitedwart § e? The mysterious 
over stricken Antarctic draws near for the ampere at plus white dwarfequals § case of the missing 
expedition p.270 last p.273 stellar fireworks p.274 global heat p.276 


In November 2014, the Philae lander will be the first to try to land on the surface of a comet, 67P/Churyumov-Gerasimenko, as shown in this artist’s impression. 


Comet craft ready to wake 


Stakes high as European Space Agency waits for Rosetta orbiter to come out of hibernation. 


BY ELIZABETH GIBNEY 


high tension. They often have just one 
chance to get things right, and experi- 
ments can hinge on the success of equipment 
that may be millions of kilometres away. So 
there will be considerable anxiety on 20 Janu- 
ary at the European Space Agency (ESA) when 
the comet-hunting spacecraft Rosetta is due 
to stir after almost three years of hibernation. 
With Rosetta now some 800 million kilo- 
metres from Earth, and rapidly approaching its 
target — comet 67P/Churyumov-Gerasimenko 
— the first sign that things are going to plan on 
Rosetta will be the activation ofa pre-set alarm. 


pace scientists are used to moments of 


This will trigger a series of automated events 
that should see the craft’s components warmed 
up, its spin corrected with thrusters and an 
antenna pointed at Earth to begin communica- 
tions. There will be an anxious wait. 

“I can tell you, you sweat like hell,” says 
Claudio Sollazzo, operations manager of ESA’s 
2005 Huygens mission to Saturn’s moon Titan, 
who endured about two weeks of radio silence 
after the probe was released from the Cassini 
orbiter. “With every second of delay, imme- 
diately you say, ‘OK — something has gone 
wrong. You can know you've done everything 
right, but it’s always human nature to believe 
something bad has happened.” 

Once Rosetta is awake, the stakes will get 


higher. Later this year, mission controllers 
will attempt to observe the comet up close and 
land the craft’s probe, Philae — the first time a 
comet landing has been attempted. Both objec- 
tives aim to deliver vital data about the forma- 
tion of the Solar System and life on our planet. 

The €1-billion (US$1.4-billion) mission has 
been hurtling through the Solar System since 
2004. Its 7-billion-kilometre journey has seen 
it fly past asteroids, and slingshot around Earth 
and Mars before it was put into hibernation to 
save energy in June 2011. 

Rosetta will arrive at its target in August, 
when it will map the comet’s surface, gravity, 
shape and rotation to find a suitable site to dis- 
patch the metre-cubed Philae lander (see 
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MISSION MILESTONE 


AUGUST 2015 | 
Closest NOVEMBER 2014 


approach : 
to Sun Probe landing 


DECEMBER 2015 — 


Mission end 


> ‘Mission milestones’). “We really know very 
little about this object,” says Andrea Accom- 
azzo, Rosetta’s spacecraft-operations manager. 

ESA plans to release Philae in November, 
before the comet — already spewing gas and 
dust — gets too close to the Sun, when it will 
start to produce more debris. Rosetta, which 
will ordinarily orbit at up to 100 kilometres 
from the comet, will descend to about 4 kilo- 
metres away for the landing, and will drop its 
instrument-laden probe unpowered into a zone 
about one kilometre squared. With just one shot 
at the landing, which will be automated because 
of a 30-minute delay in communications with 
Earth, the operation will be risky. “Even though 
we will have chosen the best possible site, we'll 
still need some luck to land on a safe part; says 
Mark McCaughrean, senior scientific adviser 


AUGUST 2014 


Rendezvous 


The Rosetta spacecraft is gearing up for its 
close encounter with a comet this year. 


Comet Churyumov- 
Gerasimenko is about 
4.5 kilometres long and 
3.4 km high, more than 
four times the height of 
Dubai’s Burj Khalifa, the 
world’s tallest building. 


Burj Khalifa 


at the ESA directorate of science and robotic 
exploration. Rosetta will circle the comet at the 
equivalent of walking pace, albeit as both hurtle 
along at 20 kilometres per second. 

Once Philae has secured itself on the surface 
using a harpoon, its batteries will last a few days; 
after this it must survive on solar power, which 
will be difficult once its panels are covered in 
dust. “We really don’t know how long it will last,’ 
says McCaughrean. Although the emphasis 
will be on the lander’s performance, just getting 
the orbiting spacecraft and its complex instru- 
ments into position around the comet will bea 
big success, he says. “If we put the lander on the 
surface, that’s the cherry on the cake.” 

Comets are primitive objects. Their gas, 
dust and organic molecules have changed lit- 
tle since their creation along with the Solar 


System 4.6 billion years ago. Scientists think 
that they hold strong clues about its origins. 
Comets are also thought to have delivered a 
large fraction of Earth’s water, and possibly 
amino acids, the building blocks of life. 
Assuming all goes to plan, cameras and sen- 
sors on the orbiter will examine the comet in 
detail over the next year, while spectrometers 
analyse the chemistry of its dust. The lander 
will look at the comet’s surface composition and 
structure. Both will assess the ratio of normal to 
heavy water — formed from the hydrogen iso- 
tope deuterium — to see if comet ice matches 
Earth's water signature, and their instruments 
will hunt for the complex organic molecules 
that are needed to assemble primitive life. 
Uwe Meierhenrich, an analytical chemist 
at the University of Nice Sophia Antipolis 
in France, will be anxiously waiting for the 
results. He is co-investigator of Philae’s COSAC 
(Cometary Sampling and Composition) 
experiment, which will analyse materials from 
around 20 centimetres beneath the comet’s 
surface. These might include organic materials 
that do not vaporize and never form part of the 
gas tail that astronomers can study from Earth. 
COSAC will also measure the chirality (or 
‘handedness’) of any detected amino acids, 
something that is impossible through remote 
observation and has never been done before on 
comet samples, says Meierhenrich. On Earth, 
amino acids in proteins are left-handed, so 
finding a predominance of left-handed mol- 
ecules on the comet would add weight to theo- 
ries that such a cosmic traveller seeded life on 
Earth by providing essential basic ingredients. m 


SOURCE: ESA 


ANTARCTIC EXPLORATION 


Researchers question 
rescued polar expedition 


Australian Antarctic Division says it did not approve research strategy of stricken mission. 


BY ALEXANDRA WITZE 


he Australian Antarctic Division (AAD) 

has added its voice to the growing criti- 

cism ofa stricken private polar expedi- 

tion by challenging claims that it approved the 
research element of the trip. 

On page 291 of this issue, Nick Gales, chief 
scientist of the AAD, which is based in King- 
ston, Tasmania, responds to an earlier Nature 
column by expedition head Chris Turney of 
the University of New South Wales (see Nature 
505, 133; 2014). Turney’s Australasian Ant- 
arctic Expedition aimed to retrace the steps of 


explorer Douglas Mawson, who led an outing a 
century ago. But members of Turney’s expedi- 
tion had to be rescued from their ship, the MV 
Akademik Shokalskiy, after it became trapped in 
ice at Christmas, adding fuel to a debate about 
the merits of such privately funded trips. 

Gales challenges Turney’s implicit suggestion 
that the AAD had approved the expedition’s sci- 
ence plan. The AAD did not formally review the 
research strategy, Gales notes, but had issued the 
permits required for Turney’s group to visit the 
region in which it got stuck. “It’s an important 
distinction for us,” says Gales. He adds that the 
expedition’s rescue has delayed several projects 
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in Australia’s national Antarctic programme 
that have been many years in the planning. 
Other polar scientists have criticized the nine- 
point science plan laid out on the expedition’s 
website. The plan “could be interpreted as deliv- 
ering outcomes that I believe are not possible 
from this single voyage’, says Richard Coleman, 
deputy director of the Institute for Marine and 
Antarctic Studies at the University of Tasmania 
in Hobart, Australia. For instance, the plan’s 
first bullet point says that the expedition aims 
to “gain new insights into the circulation of the 
Southern Ocean and its impact on the global 
carbon cycle”. A single trip could provide only 
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Members of the Australasian Antarctic Expedition are rescued by a Chinese helicopter after their ship, the MV Akademik Shokalskiy, became trapped in ice. 


limited glimpses into this massive question, says 
Coleman, who sits on the committee that evalu- 
ates research proposals for the AAD. 

Turney, Gales and others all agree that the 
scientific value of the expedition will be meas- 
ured in the quality of peer-reviewed science it 
produces. Yet the expedition’s fate has sharp- 
ened the long-standing tension between 
government polar-research programmes, which 
typically follow strategic plans devised through 
many rounds of peer review, and private voy- 
ages, which occasionally have science objectives 
along with other goals such as tourism or raising 
environmental awareness. 

Turney says that he regrets any confusion 
over the AAD’s involvement in the expedi- 
tion. “At no stage did I intend to convey the 
impression that the [expedition] projects had 
been subject to the competitive peer-reviewed 
process required for participants in the formal 
Australian Antarctic programme,’ he says. 

The Russian-registered Shokalskiy became 
trapped on 24 December in thick ice in the 
Commonwealth Bay region. The vessel's cap- 
tain put out a distress call, and Australian, 
Chinese, French and US ships interrupted 
their schedules and came to help. 

On 2 January, the 52 scientists, students, 
educators and journalists aboard the Shokal- 
skiy evacuated to the Australian vessel Aurora 
Australis aboard a helicopter provided by the 
Chinese icebreaker Xue Long, which had also 


> 


MORE 
ONLINE 


VIDEO OF THE WEEK 


become stuck in ice while attempting to reach 
the stricken Shokalskiy. The complex maritime 
emergency ended a few days later when the 
Xue Long and the Shokalskiy both managed to 
extricate themselves. The Xue Long continued 
on its way to scout the Ross Sea region for the 
site of a future Antarctic base, China’ fifth. 

The Australis travelled to Australia’s Casey 
Antarctic base, where it had been working 
before the rescue, with its new passengers. The 
diversion has put it behind schedule, delaying 
the resupply of several 


of Australia’s polar- “Wearenow 
research stations. A faced with 
Casey-based project _ the difficult 

to study oceanacidifi- situation of how 
cation did not receive fg salvage our 
the divingequipment gyn project.” 


it needs to scout sites 
for a carbon dioxide enrichment experiment 
to be carried out this year, among other delays. 
“The reduction in available field time for 
this season has set us back significantly and 
we will not achieve all of our goals for this 
summer,’ says Donna Roberts, the acidifica- 
tion project’s chief investigator and a senior 
research fellow at the Antarctic Climate and 
Ecosystems Cooperative Research Centre in 
Hobart. “We are now faced with the difficult 
situation of how to salvage our own project.” 
Like other governments involved in Ant- 
arctic research, Australia has a long-term 


strategic science plan built around testable 
hypotheses to answer key questions, says 
Mahlon Kennicutt, a Texas-based ocean- 
ographer and former president of the Scientific 
Committee for Antarctic Research. “Those 
that have invested great time and energy to 
pass the high bar of national funding see their 
programmes being jeopardized by those that 
might be perceived to have circumvented the 
system,’ he says. 

But expedition participant Janet Wilmshurst, 
a palaeoecologist at Landcare Research in Lin- 
coln, New Zealand, argues that the expedition 
has brought back important science. Before 
reaching Antarctica, it explored the subantarctic 
islands of New Zealand. Wilmshurst led a team 
that gathered peat cores, leaf and soil litter and 
other samples to study environmental change. 

“It was incredibly productive for us,” she 
says. For instance, three new peat cores from 
the Snares Islands will be the first from the 
island group to be analysed using modern 
techniques, providing a glimpse into climate 
history at a key location where the northern 
Southern Ocean meets a subtropical front. 

Sorting out the long-term research effects of 
the Shokalskiy rescue may take some time. “My 
biggest concern is that of the reputation for 
Antarctic science,’ says Patrick Quilty, a geol- 
ogist at the University of Tasmania who has 
worked in Antarctica for nearly five decades. 
“There are ramifications.” m 
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ADDGENE 


| NEWS IN FOCUS 


SIGNED, SEALED AND DELIVERED 


In 2013, Addgene, a biorepository in Cambridge, Massachusetts, shipped almost 94,000 plasmids, 
circular packets of DNA used to move genes into cells. 
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Shipped more than 1,000 plasmids for use as 
3 calcium indicators that light up when neurons fire. 


Shipped more than 4,000 plasmids 
used to induce regular cells to 


become pluripotent stem cells. aaa 
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Repositories share 
key research tools 


But some biological resource centres face funding issues. 


BY MONYA BAKER 


student trying to finish her thesis. In 2002, 

Melina Fan, then at Harvard Medical 
School in Boston, Massachusetts, found that 
the genes she needed for her experiments on 
metabolism were in other scientists’ freezers. 
She wrote to more than a dozen labs to request 
certain plasmids — circular packets of DNA 
that scientists use to shuttle genes into cells. 
Only about half responded, and some plasmids 
took months to arrive. When they did, some 
contained the wrong genes. 

“Tt wasnt that the scientists didn’t want to 
share,’ says Fan. “It was just that the logistics 
were difficult.” 

So in January 2004, along with her hus- 
band and her brother, Fan founded Addgene, 
a repository in Cambridge, Massachusetts, 
in which scientists can deposit plasmids for 
free and order them for US$65 apiece. In its 
first year, Addgene shipped only one plas- 
mid, containing a gene that Fan had worked 
on as a graduate student. Last year, the 
non-profit organization sent out more than 
90,000 plasmids, ranging from specialized 


I twas born out of the frustration ofa graduate 


protein-tagging tools to empty DNA rings that 
can be filled with select genes. 

Addgene's growth over its ten years demon- 
strates the extent to which biological resource 
centres (BRCs) have become crucial middle- 
men, enabling the sharing of tools that sci- 
entists say they are willing to distribute but 
often fail to. Where 


cell lines or genetic «We @re 
aaa of model preserving the 
ne ee . ie samples so they 
— TOUS" willbe available 
unreliable, ad hoc . é . 

to scientists in 


networks, BRCs have 
allowed biological 
tools to be procured 
easily online. 

“BRCs are suppliers of public goods that are 
essential to supporting the rate of scientific 
progress,” says Jeffrey Furman, an economist 
at Boston University in Massachusetts, who 
reported in 2011 that depositing cell lines, 
microbe strains and other items in BRCs could 
boost the citation rates of articles associated 
with the materials by 57-135%. And yet some 
are struggling to obtain the funding they need 
to survive. 


the future.” 
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The earliest BRCs have long, venerable 
histories. The American Type Culture Col- 
lection (ATCC) in Manassas, Virginia, estab- 
lished in 1925, holds more than 1 million 
genetic constructs, microbes and cell lines. 
And the Jackson Laboratory in Bar Harbor, 
Maine, established in 1929, maintains a collec- 
tion of 7,000 strains of inbred and genetically 
modified mice. 

Despite Addgene’s relative youth, molecular 
biologists have flocked to its services. Addgene 
shipments reveal hot scientific trends such as 
the use of genetic-editing tool CRISPR. Last 
year, Addgene shipped more than 12,000 plas- 
mids containing CRISPR tools (see ‘Signed, 
sealed and delivered’). 

But although the repositories are valued, 
maintaining them can be challenging, espe- 
cially if they hold expensive materials that 
are studied by few researchers, says Kevin 
McCluskey, curator of the Fungal Genetics 
Stock Center at the University of Missouri-Kan- 
sas City. The collection of more than 25,000 fun- 
gal strains is supported by a US National Science 
Foundation grant that has not been renewed. 

“They want us to become self-sustainable, 
but then you have to raise your fees so much 
that you impact the ability of people to use 
your resources,’ McCluskey says. He notes that 
his collection charges a mere $30 per strain 
to academic labs, whereas those distributed 
by the ATCC can cost ten times as much. (A 
spokesperson for the ATCC says that the cost 
difference is justified by the breadth and qual- 
ity of its collection.) 

Changes in technology can upset the 
apple cart, too. Pieter de Jong, director of the 
BACPAC Resources Center at the Children’s 
Hospital Oakland Research Institute in Cali- 
fornia, built up a diverse collection of artificial 
chromosomes to store and copy large chunks 
of high-quality DNA for sequencing projects 
elsewhere. At one point, the collection was 
used in the Human Genome Project. 

But the rise of faster, cheaper sequencing 
has decreased demand for artificial chromo- 
somes, and user fees no longer cover main- 
tenance, says de Jong. Last year, he discarded 
backup materials and halved the number of 
storage freezers he keeps to cut costs. Next, 
he may need to prune unique material that is 
seldom requested, such as his platypus DNA 
collection. 

But what is seldom requested today can be 
invaluable tomorrow. A heat-stable enzyme 
used in the 1980s to develop the polymerase 
chain reaction, an essential DNA-copying 
technique, came from bacteria deposited in 
the ATCC in the 1960s, with no consideration 
of its practical relevance. 

At Addgene, about one-third of the plas- 
mids are never requested, but the fees from 
the others cover the cost of maintaining them, 
says Fan. “We are preserving the samples 
so they will be available to scientists in the 
future.” m 
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A semiconductor device, seen in a scanning electron micrograph, can measure the flow of single electrons. 


METROLOGY 


Ampere to get 
rational redefinition 


Single-electron flow measured in bid to overhaul SI base unit. 


BY EUGENIE SAMUEL REICH 


a semiconductor chip one at a time — an 

experiment that should at last enable a 
rational definition of the ampere, the unit of 
electrical current. 

At present, an ampere is defined as the 
amount of charge flowing per second through 
two infinitely long wires one metre apart, such 
that the wires attract each other with a force of 
2x 10° newtons per metre of length. That defi- 
nition, adopted in 1948 and based on a thought 
experiment that can at best be approximated in 
the laboratory, is clumsy — almost as much of 
an embarrassment as the definition of the kilo- 
gram, which relies on the fluctuating mass of 
a 125-year-old platinum-and-iridium cylinder 
stored at the International Bureau of Weights 
and Measures (BIPM) in Paris. 

The new approach, described in a paper’ 
posted onto the arXiv server on 19 Decem- 
ber, would redefine the amp on the basis of 
e, a physical constant representing the charge 
of an electron. Metrologists have long sought 
such a rational definition. “It’s an enormously 
challenging thing to try and do and it’s quite 


Pens have tracked electrons crossing 


an important paper,’ says Stephen Giblin, a 
physicist at the National Physical Laboratory 
in Teddington, UK. 

The result will find favour at a meeting of 
the BIPM’s General Conference on Weights 
and Measures in November. There, metrolo- 
gists will discuss a proposal to redefine the 
ampere, the kilogram and two other standard 
(SI) units — the mole and the kelvin — in 
terms of the physical constants e, Planck’s con- 
stant, Avogadro's constant and Boltzmann’s 
constant. 

Two other basic SI units, the metre and the 
second, have already been redefined in terms 
of two constants: the speed of light and the fre- 
quency at which electrons in caesium atoms 
transition between energy levels. 

In the amp experiment’, physicist Hans 
Schumacher of the Federal Institute of Physi- 
cal and Technical Affairs (PTB) in Braunsch- 
weig, Germany, and his colleagues made use 
of a single-electron pump, a device in which 
voltage pulses prompt electrons to quantum- 
mechanically tunnel across barriers one at a 
time. The researchers tracked the paths of indi- 
vidual electrons by detecting changes in the 
electrical charge stored at points between > 
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the barriers. Primitive electron pumps 
have existed since 1990, but this is the first 
time that changes in charge have been 
detected for each hop ofan electron. 

The pump transferred just a few dozen 
electrons per second — slow enough to 
permit precision measurement and thus 
to provide proof of principle for redefin- 
ing the amp. But this is only a first step: the 
set-up would not be practical for calibrat- 
ing current-measuring ammeters, which 
need to run at higher currents. The ulti- 
mate goal is to create a ‘mise en pratique’ 
— a standard-setting experiment that can 
be reproduced in any lab to calibrate meas- 
urements of current precisely — so the race 

is now on to combine 
Schumacher’s vali- 
dation method with 
a higher-current 
pump. 
In 2012, Giblin 
pioneered a semi- 
conductor single-electron pump that 
transferred nearly one billion electrons 
per second (ref. 2), but he could not track 
them one by one. Other types of pump 
include turnstiles, in which electrons tun- 
nel between superconducting wires, and 
tunnel junctions, in which electrons tun- 
nel between aluminium islands separated 
by layers of insulating oxide. But these must 
also be operated at relatively low currents to 
track single electrons. “It’s impossible to say 
which is the winning concept,’ says Jukka 
Pekola, a physicist at Aalto University in 
Espoo, Finland, who reviewed approaches 
to redefining the amp in 2013 (ref. 3). 

Still, the amp is in good shape for the 
November meeting, as is the kelvin. The 
charge of the electron and Boltzmann’s con- 
stant have both been measured precisely, so 
both units are ready to be redefined today, 
says Frangois Piquemal, a physicist at the 
National Laboratory of Metrology and Test- 
ing in Paris. 

But the process could be delayed until 
the 2018 meeting of the BIPM general 
conference. All four units are intertwined, 
so the plan is to redefine them all at once, 
and the kilogram is causing problems. 
There are two rival approaches to its 
redefinition: a watt balance, which would 
balance a test mass against Earth’s gravity 
in terms of electrical power, and a precise 
count of atoms in a sphere of silicon. The 
two approaches give slightly different 
answers. Piquemal says that the differing 
approaches need to be reconciled before 
the units can be redefined. = 


1. Fricke, L. et a/. Preprint at http://arxiv.org/ 
abs/1312.5669 (2013). 

2. Giblin, S. P. et al. Nature Commun. 3, 930 
(2012). 

3. Pekola, J. P. et al. Rev. Mod. Phys. 85, 1421- 
1472 (2013). 
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epler clue to 
supernova puzzle 


Two white dwarfs favoured as precursors of type Ia supernovae. 


BY RON COWEN 


hey are cosmic detonations that briefly 

outshine the light of entire galaxies. 

And they were a crucial tool in the dis- 
covery of dark energy, the force that is accel- 
erating the expansion of the Universe. Yet the 
process that gives rise to type Ia supernovae 
has remained mysterious. 

Now, light from two of these stellar explo- 
sions has been captured in finer temporal 
detail than ever before, and the data are adding 
weight to an emerging view: that the explo- 
sions result from the merger of two white 
dwarfs, the burnt-out, Earth-sized remnants 
of Sun-like stars. The finding erodes a long- 
standing view that type Ia supernovae arise 
from a single white dwarf accruing material 
from an ordinary companion star, either a Sun- 
like star or an elderly, bloated red giant. 

The data have come from an unlikely 
source: NASAs Kepler mission, the space tel- 
escope that searched for alien planets by star- 
ing at some 150,000 stars in nearby reaches of 
the Milky Way. Distant galaxies also lurk in 
the telescope’s field of view, and its ability to 
collect data every 30 minutes, along with its 
sensitivity to tiny changes in brightness, made 
it ideal for recording the rise and fall of light 
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emitted during supernovae. 

Robert Olling, an astronomer at the Uni- 
versity of Maryland in College Park, was lucky 
enough to find two type Ia supernovae after a 
two-year survey of some 400 galaxies in Kepler's 
field. He reported them on 8 January at a meet- 
ing of the American Astronomical Society near 
Washington DC. “As a technical tour de force, 
it's really cool to use Kepler for more than it was 
intended, says Robert Kirshner, an astronomer 
at the Harvard-Smithsonian Center for Astro- 
physics in Cambridge, Massachusetts. 

The exceptional smoothness of the Kepler 
data is helping researchers to distinguish 
between the two competing explosion 
scenarios. Both require that a white dwarf 
takes on material from a companion object 
until the pressure ignites a runaway thermo- 
nuclear explosion. But in the ordinary com- 
panion model, the expanding shell of material 
from the white dwarf would ram into the nor- 
mal star, generating extra heat and light that 
would show up as a bump in the first days of a 
supernova’ brightening. No such bump exists 
in Olling’s data. 

That essentially rules out all red giant com- 
panions, says Olling, because these larger stars 
would result in a sizeable bump. But the data 
could still be compatible with smaller, more 
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A STAR’S LAST RITES 


Light captured from type la supernovae supports the idea that merging white dwarfs lead to the explosions. 


A Sun-like 
star becomes 
a red giant 


It sheds its 
outer layers 


Sun-like companions, says Daniel Kasen, an 
astronomer at the University of California, 
Berkeley, and a collaborator on the survey. Not 
only would these stars cause a much smaller 
bump, but the bump could be missed com- 
pletely depending on the observer’s viewing 
angle, he says. If the supernova lay between 
Kepler and the companion star, for example, the 
associated bump would probably not be seen. 
For years, the idea that type Ia supernovae 
might arise from merging white dwarfs was 
discounted because the final stages of the 
merger were thought to occur slowly, over 
the course of thousands of years. Such a slow 
accretion of material would be more likely to 


It cools and 
contracts into 
an Earth-sized 

white dwarf 


Material is 
added quickly 
from a merging 

companion. 


A runaway 
thermonuclear 
explosion is 
ignited. 


lead to the formation of a neutron star. Then, 
from around 2010, simulations began to sug- 
gest that the mergers could occur in seconds 
or minutes, allowing for the sudden pressure 
change that results in an explosion, says Stan 
Woosley, a theorist at the University of Califor- 
nia, Santa Cruz (see ‘A star’s last rites’). 

Craig Wheeler, a supernova theorist at the 
University of Texas at Austin, says that there are 
still problems with the merger model. For exam- 
ple, he says, simulations of the mergers often 
produce highly asymmetric explosions, yet 
observations so far tend to be more spherical. 
And spectroscopic observations — which split 
light into its component wavelengths — have 
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not found as much radiation from ionized iron 
atoms as merger simulations predict. 

Kepler’s exoplanet-hunting days ended in 
May 2013 after mechanical failures prevented 
it from pointing precisely enough for that task. 
But Olling says that the craft could continue to 
hunt for type Ia supernovae because the bright 
explosions do not require precise pointing. 

It will be crucial to make simultaneous 
observations from ground-based telescopes, 
he notes, because Kepler only records bright- 
ness and cannot split light into spectra. But to 
perform such joint observations, Kepler will 
need to point in the opposite direction; Olling 
hopes that the Kepler team will agree to this. 
NASA is expected to announce its plans for the 
impaired spacecraft this summer. m 


CORRECTION 

This News story ‘Particle-physics papers 
set free’ (Nature 505, 141; 2014) wrongly 
stated that CERN has decreed that all 
articles based on its research must be open 
access. In fact, it is still reviewing its policy. 
And in the World View ‘Academics should 
not remain silent on hacking’ (Nature 504, 
333; 2013), the URL for the non-profit 
organization recruiting experts should have 
been opencryptoaudit.org. 
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THE CASE OF THE 


MISSING HEA 


Sixteen years into the mysterious ‘global-warming hiatus’, 
scientists are piecing together an explanation. 


BY JEFF TOLLEFSON 


the time, with a subtle weakening of the tropical trade winds blowing across the Pacific 

Ocean in late 1997. These winds normally push sun-baked water towards Indonesia. 
When they slackened, the warm water sloshed back towards South America, resulting in a spec- 
tacular example of a phenomenon known as El Nifio. Average global temperatures hit a record 
high in 1998 — and then the warming stalled. 

For several years, scientists wrote off the stall as noise in the climate system: the natural varia- 
tions in the atmosphere, oceans and biosphere that drive warm or cool spells around the globe. 
But the pause has persisted, sparking a minor crisis of confidence in the field. Although there 
have been jumps and dips, average atmospheric temperatures have risen little since 1998, in seem- 
ing defiance of projections of climate models and the ever-increasing emissions of greenhouse 
gases. Climate sceptics have seized on the temperature trends as evidence that global warming 


T he biggest mystery in climate science today may have begun, unbeknownst to anybody at 
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has ground to a halt. Climate scientists, meanwhile, know that heat must 
still be building up somewhere in the climate system, but they have 
struggled to explain where it is going, if not into the atmosphere. Some 
have begun to wonder whether there is something amiss in their models. 

Now, as the global-warming hiatus enters its sixteenth year, scien- 
tists are at last making headway in the case of the missing heat. Some 
have pointed to the Sun, volcanoes and even pollution from China as 
potential culprits, but recent studies suggest that the oceans are key to 
explaining the anomaly. The latest suspect is the El Nifio of 1997-98, 
which pumped prodigious quantities of heat out of the oceans and into 
the atmosphere — perhaps enough to tip the equatorial Pacific into a 
prolonged cold state that has suppressed global temperatures ever since. 

“The 1997 to 98 El Nifio event was a trig- 
ger for the changes in the Pacific, and I think 
that's very probably the beginning of the hia- 
tus,’ says Kevin Trenberth, a climate scientist at 
the National Center for Atmospheric Research 
(NCAR) in Boulder, Colorado. According to 
this theory, the tropical Pacific should snap 
out of its prolonged cold spell in the coming 
years.“Eventually,’ Trenberth says, “it will 
switch back in the other direction” 


STARK CONTRAST 

On a chart of global atmospheric tempera- 
tures, the hiatus stands in stark contrast to the 
rapid warming of the two decades that pre- 
ceded it. Simulations conducted in advance of 
the 2013-14 assessment from the Intergovern- 
mental Panel on Climate Change (IPCC) sug- 
gest that the warming should have continued at 
an average rate of 0.21 °C per decade from 1998 
to 2012. Instead, the observed warming dur- 
ing that period was just 0.04°C per decade, as 
measured by the UK Met Office in Exeter and 
the Climatic Research Unit at the University of 
East Anglia in Norwich, UK. 

The simplest explanation for both the 
hiatus and the discrepancy in the models 
is natural variability. Much like the swings 
between warm and cold in day-to-day weather, 
chaotic climate fluctuations can knock global 
temperatures up or down from year to year and 
decade to decade. Records of past climate show 
some long-lasting global heatwaves and cold 
snaps, and climate models suggest that either of these can occur as the 
world warms under the influence of greenhouse gases. 

But none of the climate simulations carried out for the IPCC pro- 
duced this particular hiatus at this particular time. That has led scep- 
tics — and some scientists — to the controversial conclusion that the 
models might be overestimating the effect of greenhouse gases, and 
that future warming might not be as strong as is feared. Others say that 
this conclusion goes against the long-term temperature trends, as well 
as palaeoclimate data that are used to extend the temperature record far 
into the past. And many researchers caution against evaluating models 
on the basis of a relatively short-term blip in the climate. “If you are 
interested in global climate change, your main focus ought to be on 
timescales of 50 to 100 years,” says Susan Solomon, a climate scientist 
at the Massachusetts Institute of Technology in Cambridge. 

But even those scientists who remain confident in the underlying 
models acknowledge that there is increasing pressure to work out just 
what is happening today. “A few years ago you saw the hiatus, but it 
could be dismissed because it was well within the noise,’ says Gabriel 
Vecchi, a climate scientist at the US National Oceanic and Atmospheric 
Administration’s Geophysical Fluid Dynamics Laboratory in Princeton, 
New Jersey. “Now it’s something to explain” 


WARM PHASE 


-08 -0.6 -04 -0.2 


COLD PHASE 


THE FICKLE OCEAN 


Every few decades, the Pacific flips between warmer 
and cooler conditions in the eastern part of the basin 
and along much of the equator. This cycle is called the 
Pacific Decadal Oscillation (PDO). 


Departures from average ocean temperatures (°C) 
associated with the warm and cool phases of the PDO 
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Researchers have followed various leads in recent years, focusing 
mainly ona trio of factors: the Sun’, atmospheric aerosol particles* and 
the oceans’. The output of energy from the Sun tends to wax and wane 
onan 11-year cycle, but the Sun entered a prolonged lull around the turn 
of the millennium. The natural 11-year cycle is currently approaching its 
peak, but thus far it has been the weakest solar maximum in a century. 
This could help to explain both the hiatus and the discrepancy in the 
model simulations, which include a higher solar output than Earth has 
experienced since 2000. 

An unexpected increase in the number of stratospheric aerosol parti- 
cles could be another factor keeping Earth cooler than predicted. These 
particles reflect sunlight back into space, and scientists suspect that 
small volcanoes — and perhaps even industri- 
alization in China — could have pumped extra 
aerosols into the stratosphere during the past 
16 years, depressing global temperatures. 

Some have argued that these two factors 
could be primary drivers of the hiatus, but 
studies published in the past few years sug- 
gest that their effects are likely to be relatively 
small*”. Trenberth, for example, analysed their 
impacts on the basis of satellite measurements 
of energy entering and exiting the planet, 
and estimated that aerosols and solar activity 
account for just 20% of the hiatus. That leaves 
the bulk of the hiatus to the oceans, which 
serve as giant sponges for heat. And here, the 
spotlight falls on the equatorial Pacific. 


BLOWING HOT AND COLD 

Just before the hiatus took hold, that region had 
turned unusually warm during the El Nifo of 
1997-98, which fuelled extreme weather across 
the planet, from floods in Chile and Califor- 
nia to droughts and wildfires in Mexico and 
Indonesia. But it ended just as quickly as it 
had begun, and by late 1998 cold waters — a 
mark of El Nifios sister effect, La Nifia — had 
returned to the eastern equatorial Pacific with 
a vengeance. More importantly, the entire 
eastern Pacific flipped into a cool state that has 
continued more or less to this day. 

This variation in ocean temperature, known 
as the Pacific Decadal Oscillation (PDO), may 
bea crucial piece of the hiatus puzzle. The cycle 
reverses every 15-30 years, and in its positive phase, the oscillation 
favours El Nifio, which tends to warm the atmosphere (see “The fickle 
ocean’). After a couple of decades of releasing heat from the eastern 
and central Pacific, the region cools and enters the negative phase of 
the PDO. This state tends towards La Nifia, which brings cool waters 
up from the depths along the Equator and tends to cool the planet. 
Researchers identified the PDO pattern in 1997, but have only recently 
begun to understand how it fits in with broader ocean-circulation pat- 
terns and how it may help to explain the hiatus. 

One important finding came in 2011, when a team of researchers at 
NCAR led by Gerald Meehl reported that inserting a PDO pattern into 
global climate models causes decade-scale breaks in global warming’. 
Ocean-temperature data from the recent hiatus reveal why: in a subse- 
quent study, the NCAR researchers showed that more heat moved into 
the deep ocean after 1998, which helped to prevent the atmosphere from 
warming’. In a third paper, the group used computer models to docu- 
ment the flip side of the process: when the PDO switches to its positive 
phase, it heats up the surface ocean and atmosphere, helping to drive 
decades of rapid warming’. 

A key breakthrough came last year from Shang-Ping Xie and 
Yu Kosaka at the Scripps Institution of Oceanography in La Jolla, 
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THE PACIFIC’S GLOBAL REACH 


As researchers have investigated why global temperatures have not risen much since 1998, many have focused on an ocean cycle 
known as the Pacific Decadal Oscillation (PDO). During periods when the PDO index is positive and the eastern Pacific is warm, global 
temperatures have risen quickly. During spells when the PDO index is negative, the warming has stagnated. 
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California. The duo took a different tack, by programming a model 
with actual sea surface temperatures from recent decades in the eastern 
equatorial Pacific, and then seeing what happened to the rest of the 
globe®. Their model not only recreated the hiatus in global temperatures, 
but also reproduced some of the seasonal and regional climate trends 
that have marked the hiatus, including warming in many areas and 
cooler northern winters. 

“Tt was actually a revelation for me when I saw that paper,” says John 
Fyfe, a climate modeller at the Canadian Centre for Climate Modelling 
and Analysis in Victoria. But it did not, he adds, explain everything. 
“What it skirted was the question of what is driving the tropical cooling” 

That was investigated by Trenberth and John Fasullo, also at NCAR, 
who brought in winds and ocean data to explain how the pattern 
emerges’. Their study documents how tropical trade winds associ- 
ated with La Nifia conditions help to drive warm water westward and, 
ultimately, deep into the ocean, while promoting the upwelling of cool 
waters along the eastern equatorial region. In extreme cases, such as the 
La Nifia of 1998, this may be able to push the ocean into a cool phase 
of the PDO. An analysis of historical data buttressed these conclusions, 
showing that the cool phase of the PDO coincided with a few decades 
of cooler temperatures after the Second World War (see “The Pacific’s 
global reacl’), and that the warm phase lined up with the sharp spike 
seen in global temperatures between 1976 and 1998 (ref. 4). 

“T believe the evidence is pretty clear? says Mark Cane, a climatolo- 
gist at Columbia University in New York. “It’s not about aerosols or 
stratospheric water vapour; it’s about having had a decade of cooler 
temperatures in the eastern equatorial Pacific” 


HEATED DEBATE 

Cane was the first to predict the current cooling in the Pacific, although 
the implications werent clear at the time. In 2004, he and his colleagues 
found that a simple regional climate model predicted a warm shift in the 
Pacific that began around 1976, when global temperatures began to rise 
sharply’. Almost as an afterthought, they concluded their paper witha 
simple forecast: “For what it is worth the model predicts that the 1998 
El Nifio ended the post-1976 tropical Pacific warm period.” 

It is an eerily accurate result, but the work remains hotly contested, 
in part because it is based on a partial climate model that focuses on 
the equatorial Pacific alone. Cane further maintains that the trend over 
the past century has been towards warmer temperatures in the western 
Pacific relative to those in the east. That opens the door, he says, to the 
possibility that warming from greenhouse gases is driving La Nifia-like 
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conditions and could continue to do so in the future, helping to suppress 
global warming. “If all of that is true, it’s a negative feedback, and if we 
don't capture it in our models they will overstate the warming,” he says. 

There are two potential holes in his assessment. First, the histori- 
cal ocean-temperature data are notoriously imprecise, leading many 
researchers to dispute Cane’ assertion that the equatorial Pacific shifted 
towards a more La Nifia-like state during the past century’®. Second, many 
researchers have found the opposite pattern in simulations with full cli- 
mate models, which factor in the suite of atmospheric and oceanic interac- 
tions beyond the equatorial Pacific. These tend to reveal a trend towards 
more El Nifio-like conditions as a result of global warming. The difference 
seems to lie, in part, in how warming influences evaporation in areas of 
the Pacific, according to Trenberth. He says the models suggest that global 
warming has a greater impact on temperatures in the relatively cool east, 
because the increase in evaporation adds water vapour to the atmosphere 
there and enhances atmospheric warming; this effect is weaker in the 
warmer western Pacific, where the air is already saturated with moisture. 

Scientists may get to test their theories soon enough. At present, 
strong tropical trade winds are pushing ever more warm water west- 
ward towards Indonesia, fuelling storms such as November's Typhoon 
Haiyan, and nudging up sea levels in the western Pacific; they are now 
roughly 20 centimetres higher than those in the eastern Pacific. Sooner 
or later, the trend will inevitably reverse. “You can't keep piling up warm 
water in the western Pacific; Trenberth says. “At some point, the water 
will get so high that it just sloshes back.” And when that happens, if 
scientists are on the right track, the missing heat will reappear and tem- 
peratures will spike once again. m SEE EDITORIAL P.261 


Jeff Tollefson covers climate, energy and the environment for Nature. 
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<— 
ACCRETION POWER 


At the centre of every 
large galaxy lives a giant 
black hole that swallows 
gas or dust clouds 

that stray too close. As 
matter spirals inwards, 
it is compressed into an 
accretion disk. By the 
time it falls into the black 
hole, the matter is so hot 
that much of its mass 

is converted to energy, 
which emerges as heat, 
light and jets of high- 
energy particles. 


any of the astronomers and physicists invited 
to the meeting feared for their safety. Others 
felt that the event should be cancelled out- 
right. To hold a conference in Dallas, Texas, 
only weeks after US President John E Kennedy had been 
assassinated there — it just seemed disrespectful. 

In the end, the first Texas Symposium on Relativis- 
tic Astrophysics went ahead as scheduled, starting on 
16 December 1963, and most of the invited scientists did 
go — after the mayor of Dallas sent them a telegram urging 
their attendance. But the shadow cast by Kennedy’s death 
added to the already surreal mood as they grappled with a 
phenomenon that seemed unfathomable. 

That year, observers had discovered that a collection of 
mysterious ‘quasi-stellar’ objects, dubbed quasars, were not 
just oddball versions of ordinary stars. They were cosmi- 
cally distant, glowing with radiation that had travelled for 
billions of years to reach Earth. They were prodigiously 
bright, able to outshine 100 galaxies containing billions of 
normal stars. And they were astonishingly small for such 
bright objects — no bigger than our own Solar System. The 
presence of so much energy in so small a volume would 
bend space-time, as described by Albert Einstein's general 
theory of relativity, and might even cause the matter there 
to collapse into a gigantic black hole: an exotic possibility 
that at the time seemed like pure science fiction. 

“Quasars really changed everything,” says Michael 
Turner, a cosmologist at the University of Chicago, Illinois, 
who gave a speech commemorating the 50th anniversary 
of that inaugural meeting last month at the 27th Texas sym- 
posium, again in Dallas. Einstein's theory, which until the 
1960s had been considered a niche idea with little to do with 
practical astronomy, was pushed to the fore. “The floodgates 


had opened,” says Turner: observations soon proved that 
the Universe was stranger and more violent than astrono- 
mers had ever imagined. Explosions and eruptions were 
commonplace. And Solar System-sized black holes with 
masses measured in millions or billions of Suns turned out 
to lie not just inside quasars, but at the centre of every large 
galaxy in the cosmos — including our own. 

As last month’s symposium made clear, giant black 
holes still pose many puzzles, ranging from how they 
produce and release enormous amounts of energy to how 
they grew rapidly in the early Universe. Researchers are 
now starting to glean important clues from instruments 
including NASA’s Nuclear Spectroscopic Telescope Array 
(NuSTAR), which was launched in mid-2012 as the first 
spacecraft dedicated to studying these objects. And this 
year astronomers will get a rare chance to study the eating 
habits of the black hole at the centre of our own Galaxy, 
when it feasts on a cloud of gas set to stray too close to its 
gravitational trap. 

The basics of black holes’ energy production are now 
well established (see ‘Accretion power’). Stars, gas and 
dust moving through the core of a galaxy get pulled in and 
compressed by the black hole’s gravity, growing hotter and 
hotter as they spiral inwards, forming an accretion disk. By 
the time the superheated material approaches a spinning 
black hole’s event horizon — the point of no return, beyond 
which even light cannot escape — up to 42% of its mass has 
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been converted to energy. 

That energy emerges in the form of heat, light and, often, 
jets of high-speed particles that rocket in opposite direc- 
tions perpendicular to the accretion disk. These jets can 
extend for thousands or even millions of parsecs. If one 
happens to be aimed directly at Earth, astronomers see 
the object as a quasar. If the jets point sideways instead, 
astronomers see the object as a galaxy with a very bright 
‘active galactic nucleus. And if the black hole’s food supply 
is somehow restricted, so that it accretes very little gas and 
dust, the object is effectively invisible. 

Within that general picture, however, the details can be 
perplexing. Starting in 2006, for example, several sky sur- 
veys began to indicate that jets were emerging from their 
parent black holes with three times more energy than was 
contained in the original fuel, producing what seemed to 
bea gross violation of the conservation of energy. 


MAGNETIC BOOST 


At last month’s conference, physicist Roger Blandford 
of Stanford University in California described a pos- 
sible solution based on simulations of jet formation’”. 
He and his colleagues imagine a rapidly spinning black 
hole with a strong magnetic field, properties that are dif- 
ficult to detect directly but are theoretically plausible. 
The lines of the magnetic field are assumed to go out to 
great distances, threading through the accretion disk like 
stiff wires and dragging the disk’s gas along with them as 
they rotate. The simulations show that under the right 
circumstances, the magnetic field can transfer enough of 
the black hole’s rotational energy into the disk to power 
the anomalously strong jets. 

NuSTAR recently made the first 
definitive measurement of a super- 
massive black hole, revealing that it is 
spinning very fast indeed. This work 
was prompted by simulations that 
suggested a way to gauge the rotation 
of a black hole using X-rays emitted from near the event 
horizon. Rapidly spinning black holes should pull material 
closer to that horizon and subject it to intense gravity that 
would shift escaping X-rays to redder, less energetic wave- 
lengths. 

Although astronomers had seen hints of this gravita- 
tional imprint with earlier X-ray telescopes, they could not 
rule out the possibility that gas clouds were blanketing the 
accretion disk and confounding the result. But NuSTAR is 
sensitive to X-rays that have ten times higher energies than 
its predecessors could measure, and that punch through 
any such clouds. At the December meeting, NuSTAR chief 
scientist Fiona Harrison, an astronomer at the California 
Institute of Technology in Pasadena, reported seeing a clear 
signal of red-shifted X-rays from a relatively nearby spiral 
galaxy known as NGC 1365. Taken together with meas- 
urements at lower X-ray energies made by the European 
Space Agency’s XMM-Newton satellite, the observations 
showed that NGC 1365's central black hole was spinning at 
nearly the maximum rate allowed by Einstein’s theory’. It 
had enough rotational energy to tear apart its entire home 

galaxy, if that energy could somehow 


> NATURE.COM be unleashed. 

Read more about the NGC 1365 may not be typical. But 
Milky Way’s black as NuSTAR and future spacecraft 
hole: begin to measure black-hole spins 
go.nature.com/hm8wr9 further back in time, Harrison says, 
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the data may shed light on another conundrum. Astrono- 
mers have found quasars that are powered by billion-solar- 
mass black holes dating back to some 750 million years 
after the Big Bang, when the Universe was less than 6% of 
its current age. How did they get so big so fast? 

A black hole’s spin rate may be a kind of fossil trace 
of its formation, Harrison explains. Supermassive black 
holes are too big to have been formed by a star collapsing 
under its own gravity, like stellar-mass black holes. If the 
giant black holes were built from many smaller ones, each 
merger would have brought together black holes spinning 
in random directions. After millions or 
billions of years of such collisions, the 
full-grown beast would have a net spin 
close to zero. But if the giant black hole 
had been built by the merger of just a few 
medium-sized objects, the growth could 
have been quicker, the spins would not necessarily have 
cancelled one another out, and the net rotation could be 
quite high. 

The near-maximum spin of the black hole in NGC 1365 
suggests that at least some supermassive black holes grew 
through rapid mergers — although that still leaves the 
question of where the original medium-sized black holes 
came from. 


FAST SPIN, SLOW GROWTH 


Yet high spin could be a problem for black-hole growth in 
the early Universe, says Avi Loeb of the Harvard-Smith- 
sonian Center for Astrophysics in Cambridge, Massachu- 
setts. A rapidly rotating black hole tends to drag the inner 
edge of the accretion disk along with it, pulling it inwards, 
so the infalling matter has to trace out a longer, slower spi- 
ral to reach the event horizon than it would if the black hole 
were spinning slowly. And that provides more time for its 
mass to be converted into radiation instead of adding to 
the hole’s mass. 

It is conceivable that strong magnetic fields came to the 
rescue, says Loeb. By transferring the black hole’s rotational 
energy to the outer disk, they could quickly slow its spin, 
allow more matter to dive inwards and help the earliest 
black holes to pack on mass. If that is so, then future meas- 
urements will show that supermassive black holes have 
relatively modest spins. 

But Loeb’s favourite model for how black holes could 
grow in a hurry involves episodes in which the monster 
gorges itself on a stream of material so dense and opaque 
that photons do not have enough time to leak out before the 
gas makes its final plunge. The radiation is carried inwards 
instead of escaping, and the black hole swallows its energy 
as extra mass’. 

Sometimes, a strong magnetic field can stunt a black 
hole, rather than helping it to grow. That could be what is 
happening to Earth's closest giant black hole, Sagittarius 
A*, which lies just 8,300 parsecs from Earth at the heart of 
the Milky Way. As such objects go, our local specimen is on 
the small side, with a mass of only four million Suns. And 
its emissions are minimal. 

The question is, why? It may simply be that there is not 
much gas and dust in the Milky Way’s centre for the black 
hole to swallow. Or maybe something else is at work, says 
Mitchell Begelman, an astrophysicist at the University of 
Colorado Boulder. “There is a lot of interesting specula- 
tion that some accretion flows are ‘magnetically arrested,” 
he says. 
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Last year, for example, NuSTAR discovered a magnetar 
— ahighly magnetized neutron star — in an orbit close 
enough to Sagittarius A* for astronomers to use it to probe 
the black hole’s magnetic field. A close examination of the 
magnetar’s radio emissions shows that the magnetic field 
surrounding Sagittarius A* is both sizable and highly 
ordered’ — perhaps enough to block the black hole’s food 
supply and put it on a near-starvation diet. 

Our black hole does occasionally get a little nourish- 
ment. Observers are hoping to watch what happens this 
March, when a distended object called G2 is predicted to 


AS SUEPANEAVVA PLASM AME Si fab 


come dangerously close to Sagittarius A*. The object, either 
a gas cloud ora star with a distended gaseous envelope, will 
be torn apart by the black hole’s gravitational tidal forces. If 
it is gas, the resulting fireworks could be spectacular. But 
if G2 isa star, the chances of fireworks will be slimmer: it 
will keep a firmer grip on the gas and less material will fall 
in, says Andrea Ghez, an astronomer at the University of 
California, Los Angeles (see Nature 495, 296-298; 2013). 

Either way, astronomers should get a better under- 
standing of what really happens when something falls 
into a giant black hole. And they may well have a preview 
in the next few months. In observations unveiled at the 
Texas meeting, NuSTAR showed that the neighbourhood 
of Sagittarius A* contains an assortment of small, stellar- 
sized black holes and neutron stars. 

“It’s a rare treat that we've been given,’ says astrophysicist 
Zoltan Haiman of Columbia University in New York City, 
who has helped to carry out simulations which suggest that 
G2’s fateful journey may lead to a collision with one of the 
small black holes’. 

Sagittarius A* promises even more excitement as astron- 
omers gain new observational tools. Over the next few 
years, all 64 of the radio dishes from the Atacama Large 
Millimeter/submillimeter Array in northern Chile are 
expected to join other radio telescopes around the world 
to create an Earth-sized network. This combination could 
get an ultra-high-resolution snapshot of how the black hole 
bends radiation from objects on its far side into a thin ring, 
or shadow, around Sagittarius A*. Everyone expects the 
shape of the shadow to conform to the predictions of Ein- 
stein’s theory. But if it doesn’t — if general relativity does 
not correctly describe space-time around a black hole — 
the network could offer crucial clues about what theory 
should replace it. 

“That’s the big-picture question,” says Jonathan 
McKinney, a physicist at the University of Maryland in 
College Park. Fifty years after the first Texas symposium, 
“everyone wants to know if Einstein was right”. m 


Ron Cowen is a freelance writer based in Silver Spring, 
Maryland. 
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Time to leave GDP behind 


Gross domestic product is a misleading measure of national success. Countries 
should act now to embrace new metrics, urge Robert Costanza and colleagues. 


obert F. Kennedy once said that 
R: country’s gross domestic prod- 
uct (GDP) measures “everything 
except that which makes life worthwhile”. 
The metric was developed in the 1930s 
and 1940s amid the upheaval of the Great 
Depression and global war. Even before 
the United Nations began requiring coun- 
tries to collect data to report national GDP, 
Simon Kuznets, the metric’s chief architect, 
had warned against equating its growth 
with well-being. 
GDP measures mainly market trans- 
actions. It ignores social costs, environmental 
impacts and income inequality. Ifa business 


used GDP-style accounting, it would aim 
to maximize gross revenue — even at the 
expense of profitability, efficiency, sustain- 
ability or flexibility. That is hardly smart or 
sustainable (think Enron). Yet since the end 
of the Second World War, promoting GDP 
growth has remained the primary national 
policy goal in almost every country’. 
Meanwhile, researchers have become 
much better at meas- 
uring what actually 


For more on does make life worth- 
sustainable while. The environ- 
development goals: mental and social 


effects of GDP growth 


can be estimated, as can the effects of income 
inequality’. The psychology of human well- 
being can now be surveyed comprehensively 
and quantitatively**. A plethora of experi- 
ments has produced alternative measures of 
progress (see Supplementary Information; 
go.nature.com/bnquxn). 

The chance to dethrone GDP is now 
in sight. By 2015, the UN is scheduled to 
announce the Sustainable Development 
Goals, a set of international objectives to 
improve global well-being. Developing 
integrated measures of progress attached 
to these goals offers the global commu- 
nity the opportunity to define what 
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> sustainable well-being means, how to 
measure it and how to achieve it. Missing 
this opportunity would condone growing 
inequality and the continued destruction 
of the natural capital on which all life on 
the planet depends. 


DETHRONING GDP 

When GDP was instituted seven decades 
ago, it was a relevant signpost of progress: 
increased economic activity was credited 
with providing employment, income and 
amenities to reduce social conflict and pre- 
vent another world war. 

But the world today is very different from 
the one faced by the global leaders who 
met to plan the post-war economy in 1944 
in Bretton Woods, New Hampshire. The 
emphasis on GDP in developed countries 
now fuels social and environmental insta- 
bility. It also blinds developing countries to 
possibilities for more-sustainable models of 
development. 

Soaring economic activity has depleted 
natural resources. Much of the generated 
wealth has been unequally distributed, lead- 
ing to a host of social problems’. The phi- 
losopher John Stuart Mill noted more than 
200 years ago that, once decent living stand- 
ards were assured, human efforts should be 
directed to the pursuit of social and moral 
progress and the increase of leisure, not the 
competitive struggle for material wealth. Or 
as the economist John Kenneth Galbraith 
once observed: “To furnish a barren room 
is one thing. To continue to crowd in fur- 
niture until the foundation buckles is quite 
another.’ 

The limits of GDP are now clear. 
Increased crime rates do not raise living 
standards, but they can lift GDP by raising 
expenditures on security systems. Despite 
the destruction wrought by the Deepwater 
Horizon oil spill in 2010 and Hurricane 
Sandy in 2012, both events boosted US GDP 
because they stimulated rebuilding. 


WEIGHING THE ALTERNATIVES 

Alternative measures of progress can be 
divided into three broad groups (see Sup- 
plementary Information). Those in the first 
group adjust economic measures to reflect 
social and environmental factors. The sec- 
ond group consists of subjective measures 
of well-being drawn from surveys. The third 
group relies on weighted composite indica- 
tors of well-being including housing, life 
expectancy, leisure time and democratic 
engagement. 

Adjusted economic measures. These 
are expressed in monetary units, making 
them more readily comparable to GDP. 
Such indices consider annual income, net 
savings and wealth. Environmental costs 
and benefits (such as destroying wetlands 
or replenishing water resources) can also 


be factored in. One example is the genuine 
progress indicator (GPI). This metric is cal- 
culated by starting with personal consump- 
tion expenditures, a measure of all spending 
by individuals and a major component of 
GDP, and making more than 20 additions 
and subtractions to account for factors such 
as the value of volunteer work and the costs 
of divorce, crime and pollution’. 

Crucially, unlike other measures in the 
first group, GPI considers income distribu- 
tion. A dollar’s worth of increased income 
to a poor person boosts welfare more than 
a dollar’s worth of increased income does 
for a rich person. And a big gap between the 

richest and the poor- 


“GDPis est in a country — as 
dangerously in the United States 
inadequate and, increasingly, in 
as ameasure China and India — 
of quality of correlates with social 
life.” problems, including 


higher rates of drug 
abuse, incarceration and mistrust, and 
poorer physical and mental health’. 

These adjustments matter. A 2013 study 
comparing the GDP per capita and the GPI 
per capita of 17 countries comprising just 
over half the global population found star- 
tling divergences between the two metrics. 
The measures were highly correlated from 
1950 until about 1978, when they moved 
apart as environmental and social costs 
began to outweigh the benefits of increased 
GDP (see ‘Genuine progress flattens’). Tell- 
ingly, life satisfaction is highly correlated with 
GPI per capita, but not with GDP per capita. 

Some governments are taking this seri- 
ously. Two US states, Vermont and Mary- 
land, have in the past three years adopted 
GPI as a measure of progress and have 
implemented policies specifically aimed at 
improving it. 

Subjective measures of well-being. The 
most comprehensive of these is the World 
Values Survey (WVS), which covers about 
70 countries and includes questions about 
how satisfied people are with their lives. 
Starting in 1981, the WVS is conducted 
in ‘waves, the sixth of which is currently 
in progress. Another example is the gross 
national happiness index used in Bhutan. 
This measure uses elaborate surveys that ask 
how content people feel in nine domains: 
psychological well-being, standard of living, 
governance, health, education, community 
vitality, cultural diversity, time use and eco- 
logical diversity. 

Subjective well-being has been highly 
studied, and has even been recommended 
as the most appropriate measure of societal 
progress’. But subjective indicators are tricky 
to compare across societies and cultures. For 
example, self-reported health tracks with 
clinically reported rates of morbidity and 
mortality within countries but not across 
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them®. And people are not always aware of 
the things that contribute to their well-being. 
Few of us give credit to ecosystem services 
for water supply and storm protection, for 
example. 

Weighted composite measures of several 
indicators. A comprehensive picture of sus- 
tainable societal well-being should integrate 
subjective and objective indicators’ (see 
Supplementary Information, Figure S1), as 
these measures begin to do. One example is 
the Happy Planet Index, introduced by the 
New Economics Foundation in 2006. This 
multiplies life satisfaction by life expectancy 
and divides the product by a measure of eco- 
logical impact. 

Other indices in the third group com- 
bine a range of variables, such as income, 
housing, jobs, health, civic engagement, 
safety and life satisfaction. The Better Life 
Index, developed by the Organisation for 
Economic Co-operation and Develop- 
ment, maintains a website that allows users 
to choose how to weight variables, revealing 
how the emphasis on different variables can 
influence countries’ rankings. 

Many other experiments are under way 
(see www.wikiprogress.org). None of these 
measures is perfect, but collectively they 
offer the building blocks for something 
much better than GDP. 


WHY ARE WE STUCK? 

There is broad agreement that global society 
should strive for a high quality of life that 
is equitably shared and sustainable. Sev- 
eral groups and reports have concluded 
that GDP is dangerously inadequate as a 
measure of quality of life — including those 
published by the French government’s 2008 
Commission on the Measurement of Eco- 
nomic Performance and Social Progress”®, 
the Frederick S. Pardee Center for the Study 
of the Longer-Range Future’ and the Euro- 
pean Commission's ongoing Beyond GDP 
initiative. That conclusion was also echoed 
in ‘The Future We Want, the declaration 


GENUINE PROGRESS FLATTENS 
World GDP has soared since 1950, but a metric 
for life satisfaction called GPI has not. 
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of the 2012 Rio+20 UN Conference on 


Sustainable Development agreed to by all 
UN member states. 

Nonetheless, GDP remains entrenched. 
Vested interests are partly responsible. For- 
mer US President Bill Clinton’s small move 
towards a ‘green GDP, which factored in 
some of the environmental consequences of 
growth, was killed by the coal industry. How- 
ever, much of the problem is that no alterna- 
tive measure stands out as a clear successor. 

Creating that successor will require a sus- 
tained, transdisciplinary effort to integrate 
metrics and build consensus. One poten- 
tial vehicle for doing this is the setting up 
of the UN Sustainable Development Goals 
(SDGs), a process that is now under way 
to replace the Millennium Development 
Goals (MDGs). Established in 2000, the 
MDGs comprise eight basic targets that 
include eradicating extreme poverty and 
establishing universal primary education, 
gender equality and environmental sustain- 
ability. Currently both the MDGs and the 
suggested SDGs are only lists of goals with 
isolated indicators. But the SDG process can 
and should be expanded to include compre- 
hensive and integrated measures of sustain- 
able well-being”. 

If undertaken with sufficiently broad 
participation, the hunt for the successor to 
GDP might be completed by 2015. There are 
significant barriers to doing this, including 
bureaucratic inertia and the tendency of 
governments, academia and other groups 
to work in isolation. These barriers can be 
overcome with dedicated leadership. Cru- 
cially, people can now communicate across 
the globe with an ease unthinkable in the 
days of Bretton Woods. 


Any ‘top-down process must be supple- 
mented with a ‘bottom-up’ engagement of 
civil society that includes city and regional 
governments, non-governmental organi- 
zations, business and other parties. We 
recently formed the Alliance for Sustain- 
ability and Prosperity (www.asap4all.com) 
to do just that. This web-based ‘network of 
networks’ can communicate research about 
sustainable quality of life and the elements 
that contribute to it (see Supplementary 
Information), and so help to build consen- 
sus among the thousands of groups now 
concerned with these issues. 

The successor to GDP should bea new set 
of metrics that integrates current knowledge 
of how ecology, economics, psychology and 
sociology collectively contribute to estab- 
lishing and measuring sustainable well- 
being. The new metrics must garner broad 
support from stakeholders in the coming 
conclaves. 

It is often said that what you measure is 
what you get. Building the future we desire 
requires that we measure what we want, 
remembering that it is better to be approxi- 
mately right than precisely wrong. = 
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A lab technician scans a consumer gene-testing kit. 


The FDA is overcautious 
on consumer genomics 


AUS drug-agency clampdown is unwarranted 
without evidence of harm, say Robert C. Green 
and Nita A. Farahany. 


spit in a tube, ship it to a company and 

later log on to a website to learn what 
their DNA reveals about non-medical traits 
such as hair texture and ancestry, as well as 
whether they carry DNA variants associated 
with increased risks for diseases including 
type 2 diabetes and Alzheimer’s. More than 
500,000 people have bought such tests. 

On 22 November last year, the company 
that has performed the bulk of these tests 
received a letter from the US Food and Drug 
Administration (FDA). The FDA ordered 
23andMe, based in Mountain View, Cali- 
fornia, to “immediately discontinue mar- 
keting” its testing kit and personal genome 
services, which the agency says offer medical 
advice and so require regulatory approval. 
“Serious concerns are raised if test results are 
not adequately understood by patients or if 
incorrect test results are reported,’ the FDA 
wrote. Notably, it said, genetic results could 
drive consumers to take extreme steps, such 
as having unnecessary surgery to prevent 
cancer. Consumers might also abandon or 
alter prescribed treatments without consult- 
ing health professionals, the letter alleged. 

Two weeks after receiving the letter, 
23andMe took steps to allay the FDA's con- 
cerns. It continues to offer DNA testing 


Sz 2007, people have been able to 


and analysis, but no longer provides new 
consumers with genetic interpretations that 
relate to health. 

The FDA invoked the precautionary prin- 
ciple — acting on the basis of speculations of 
potential harm rather than reported harm. 
Although the FDA requires manufacturers 
of drugs and diagnostic devices to prove 
safety and efficacy before marketing, we feel 
that this approach is unwarranted in regu- 
lating 23andMe'’s personal genome service. 

In its earlier warning letters, the agency 
said that genomic health reporting “appears 
to meet the definition” ofa medical device. In 
its November correspondence, it states that 
23andMe’s service is a medical device. The 
FDA now claims jurisdiction over companies 
that provide health-related interpretations 
with genomic data (but not those that pro- 
vide genomic data alone). But such interpre- 
tations, particularly about common genetic 
variants, relate only indirectly to preventing 
or diagnosing disease. In this sense, it is like 
the inferences drawn about rapid weight loss 
measured by a bathroom scale. 

The FDA cannot reasonably regulate all 
such indirect information as medical devices. 
Moreover, as the court cases of Sorrell v. IMS 
Health (2011) and United States v. Caronia 
(2012) demonstrate, doing so could put FDA 
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regulations in greater tension with the First 
Amendment of the US Constitution, which 
protects the rights of individuals to receive 
information, and of ‘commercial speech’. 
Given this backdrop, the agency should avoid 
restricting consumer genomic testing unless 
faced with empirical evidence of harm. 

Certainly, there are legitimate concerns 
about 23andMe’s approach. Although the 
accuracy of the technology used is consid- 
ered to be high, there are no agreed stand- 
ards to which the company can conform for 
validating hundreds of simultaneous variant 
calls. There is also controversy about how to 
evaluate the accuracy of risks estimated using 
multiple variants or across ethnicities”’. And 
consumers might not read or fully under- 
stand the company’s clear statements that its 
tests identify only the most common genetic 
variants and cannot substitute for genetic test- 
ing ordered by physicians to assess specific 
indications, such as a family history of cancer. 

Nonetheless, as scholars who study how 
individuals respond to their own genetic 
information, we contend that the FDA’s 
precautionary approach may pose a greater 
threat to consumer health than the harms 
that it seeks to prevent. Data from more than 
5,000 participants suggest that consumer 
genomics does not provoke distress or inap- 
propriate treatment. 


EMERGING EVIDENCE 

Over the past five years, we and others have 
surveyed people who have received con- 
sumer genomics results, asking whether they 
understood them and whether the findings 
provoked distress, prompted a visit to a doc- 
tor or triggered a change in medication or 
lifestyle. 

In 2009, the Scripps Genomic Health 
Initiative (SGHI) recruited more than 
3,000 individuals from health and technol- 
ogy companies and offered subsidized test- 
ing through Navigenics (a company that is 
now owned by Life Technologies and no 
longer offers consumer testing). Surveys 
were administered before the participants 
received results, and 3 and 12 months after- 
wards. Responses from more than 2,000 par- 
ticipants showed no measurable changes in 
anxiety or psychological health*”. 

Consumers who accept subsidized test- 
ing might be less anxious than those who 
seek it out, but we found similar responses 
in the Impact of Personal Genomics (PGen) 
Study funded by the US National Institutes of 
Health (NIH) and jointly led by R.C.G. and 
health-behaviour researcher Scott Roberts. In 
2012-13, we surveyed more than 1,800 cus- 
tomers from two consumer genomics com- 
panies — Pathway Genomics and 23andMe. 
(Pathway Genomics has subsequently 
changed its business model to focus on tests 
ordered by physicians.) Preliminary data sug- 
gest that on average, customers were briefly 
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SOURCE: PRELIMINARY DATA FROM PGEN STUDY, 2012-13 


less anxious than their baseline after receiving 
their results, and never showed elevated anxi- 
ety or distress over the following year. These 
findings were consistent with those observed 
in the REVEAL (Risk Evaluation and Educa- 
tion for Alzheimer’s Disease) Study, a series 
of NIH-funded randomized trials carried out 
between 2000 and 2013. More than 700 vol- 
unteers, most of whose family members had 
been diagnosed with Alzheimer’s disease, 
underwent genetic tests assessing their own 
risk of this disease, and roughly 40% learned 
that they have increased risk. But even this 
potentially frightening disclosure caused only 
modest and transient distress®”. 

A 2010 survey by researchers at Johns 
Hopkins University in Baltimore, Maryland, 
of more than 1,000 customers of three compa- 
nies found that just over one-quarter of peo- 
ple share their results with physicians in the 
first few months after receiving them. Simi- 
lar findings were obtained by the SGHI and 
PGen Study”*. The Hopkins and PGen studies 
also found that fewer than 1% of customers 
reported altering any prescription medicines 
on the basis of their results without first con- 
sulting a physician (see “Taking actior). 

Just as patients sometimes misunderstand 
physicians during office visits, customers 
sometimes misconstrue results provided by 
consumer genomics companies. In the Hop- 
kins survey, participants were asked to inter- 
pret hypothetical results. Between 5% and 
9% interpreted straightforward messages 
incorrectly — stating that results showing 
increased risk instead indicated decreased 
or equal risk, or vice versa. 

The FDA is particularly concerned about 
how people might respond to learning that 
they have BRCA gene mutations that increase 
their risk of breast and ovarian cancers. Early 
evidence suggests that consumers respond 
appropriately. In a study carried out by 
23andMe, the company sent interview invi- 
tations to 136 customers who carried patho- 
genic mutations’. Of the 32 who accepted, 
14 men and 11 women had learned for the 
first time through consumer testing that 


TAKING ACTION 


After receiving genomics results, 42% of 
1,051 surveyed people reported positive 
changes in their health behaviour. Only 1% 
of all respondents altered a prescription 
treatment without consulting a doctor. 


Changes* 


42% 
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they carried a high-risk BRCA mutation. 
All of the women had consulted health-care 
professionals with their results, and all but one 
(who elected for surveillance and not surgery) 
had their tests repeated. Their behaviour after 
learning this information was no different 
from that of people who discover these muta- 
tions through medical channels. 

These interviews also revealed that 
30 family members of those carrying a 
pathogenic mutation decided to get tested 
themselves; 13 of them tested positive for the 
high-risk mutation and so received poten- 
tially life-saving information they might not 
have obtained otherwise. 

Clearly this is a very small study and con- 
ducted by an interested party. It provides no 
information about the majority of BRCA-pos- 
itive customers, who were not interviewed. 
Nor did the study evaluate whether women 
with undetected mutations and strong fam- 
ily history of breast cancer might have been 
falsely reassured (despite clear company mes- 
sages that their BRCA testing is not compre- 
hensive) and, as a result, mistakenly elected 
not to pursue testing in a medical context. 
Obtaining a truly representative sample is 
extremely difficult because consumers who 
participate in surveys might differ from those 
who do not, and each of the surveys suggest 
that early customers of genomics services are 
wealthier and better educated than the gen- 
eral population, and more likely to be white. 

More systematic research is needed to 
assess the outcomes of consumer genomics 
testing. The PGen Study results will be availa- 
ble by the end of this year, and further surveys 
could expand this work to a much larger sam- 
ple, performing standardized follow-ups on 
consumers who receive certain high-impact 
results. However, if consumer genomics is 
halted, researchers will not be able to con- 
tinue gathering data to better assess what the 
benefits and harms could be. 


DEMOCRATIZING HEALTH CARE 
We find the FDAs precautionary approach 
to 23andMe particularly troubling because 
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*Many respondents reported more than one change, so percentages total more than 100%. 


it could presage similar actions against other 
consumer health products. In its recent 
guidance on mobile health applications, the 
EDA left open the possibility that it will reg- 
ulate as medical devices information-based 
products such as questionnaires that evalu- 
ate the risk of a heart attack or the plethora 
of fitness trackers that help people to follow 
their weight, body temperature, heart rate, 
sleep patterns and more. Many operate as 
standalone or companion software for pre- 
dicting risks including the likelihood of sleep 
disorders, seizures or heart attacks. Down- 
loads and installations of these applications 
are expected to grow from 156 million in 
2012 to 248 million in 2017 (ref. 10). 

Such consumer products could democ- 
ratize health care by enabling individuals 
to make choices that maximize their own 
health. They follow the historical trend 
of patient empowerment that brought 
informed-consent laws, access to medical 
records and now direct access to electronic 
personal health data. 

We believe that 23andMe should be more 
transparent about how accurate its genotyp- 
ing chips are, and even more forthcoming 
about the limitations of its computer algo- 
rithms used for estimations of risk. But reg- 
ulatory constraints might stifle consumer 
genomics and other emerging products that 
could make society healthier and that do 
not fit neatly into the model of physician- 
driven health care. The effects of these prod- 
ucts should be monitored but, as long as 
emerging empirical data show no evidence 
of harm, we urge the FDA to let consumer 
genomics testing proceed. m 
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BOOKS & ARTS 


People collect water from a standpipe in Bukavu, Democratic Republic of the Congo. 


DEVELOPMENT 


Liquid assets 


Margaret Catley-Carlson is invigorated by a brace of 
books on the future of world water supplies. 


on the human demand for and use of 

fresh water feature in two new publica- 
tions. David Sedlak’s Water 4.0 explores what 
has and should be done to manage water, 
with efficacy, economics and use-effective- 
ness as the main metrics. In Blue Future, 
Maude Barlow argues that it is who manages 
water that is of prime importance; here, the 
essential metric is the widely cited but not 
yet well-defined right to water. (I should 
disclose here that Barlow strongly disap- 
proves of many of the organizations that I 
am involved with; a photo of me appears in 


S weeping, yet very different, perspectives 


one of her activist videos, I am told.) 
Sedlak, co-director of the Berkeley Water 
Center at the University of California, Berke- 
ley, has contributed a gem to the growing 
shelf of books on the emerging crises sur- 
rounding water, such as the billion people 
who lack clean supplies. His is an in-depth 
technical and often political history of water 
systems with a broad central theme — tech- 
niques for water delivery and wastewater 
treatment, how they work, and what they 
cost. For example, Sedlak amply covers the 
water-related infrastructures that are most 
vulnerable to climate-change-related storms 
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Water 4.0 
DAVID SEDLAK 
Yale University Press: 2014. 


Blue Future: Protecting Water for People 
and the Planet Forever 

MAUDE BARLOW 

The New Press: 2014. 


and sea-level rise, such as sewage systems in 
coastal cities. His focus is on US cities now; 
he gets there by way of an erudite romp 
through two millennia of water and sanita- 
tion practice and technology. 

Sedlak explains that the Roman Empire's 
aqueduct system (“Water 1.0’) delivered dif- 
ferent qualities of water for different pur- 
poses, using the least clean supplies in latrines 
and the baths. North Americans today, by 
contrast, use the same very expensive clean 
water for all purposes, most of it for water- 
ing lawns and flushing toilets. Sedlak quotes 
Karl Marx’s scorn for water management in 
Victorian England: “they can find no better 
use for the excrement of four and a half mil- 
lion human beings than to contaminate the 
Thames with it at heavy expense” (Capital, 
1867). Marx admired the extensive sewage 
farms around Paris, which irrigated crops 
with the effluent — a method still practised 
round the world. We also see how bad habits 
developed in the United States: for instance, 
in 1887 the city of Chicago in Illinois reversed 
the flow of the Chicago River and sent the 
sewage to the Mississippi River. 

Sedlak is an engineer, but does not over- 
whelm with technicalities. He marshals 
chemistry, biology and microbiology to 
answer numerous pressing questions. For 
instance, is the nasty film on top of water- 
filtration systems harmful? (No.) Can we get 
endocrine disrupters out of water, and stop 
feminizing male fish? (Perhaps, over time.) 
Can the natural functions ofa very polluted 
river, such as the Yamuna as it flows through 
Delhi, be restored through treatment? (Yes, 
with time, change of habits and investment.) 

Sedlak also gives full weight to cultural 
obstacles such as a reluctance to pay for 
water. And the economics of the long past, 
the unsustainable present and the potentially 
astronomically costly future are clearly put. 
Among the revelations is the US$13,500 per 
household it could cost to repair and update 
drinking-water systems in the United States. 

The ‘must read’ chapter of Water 4.0 is 
its last. Here Sedlak explains the books title 
in a serious exploration of the decentral- 
ized delivery and wastewater options open 
to industrialized countries and emerging 
global cities such as Beijing. Sedlak ruefully 
concedes that a more likely option is even 
greater investment in today’s “Water 3.0’ 
— centralized, complex, expensive known 
technology. I wanted him to come out swing- 
ing for widespread conversion to greywater 
systems, in which water from showers and 
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sinks is recycled to flush toilets. But he is too 
wise to find a silver bullet in any solution, 
or to dismiss any out of hand. The under- 
current in this book is that the way forward 
lies in answers — from the biological to the 
sociological — that suit the local culture. 

Sedlak and Barlow agree that our world- 
wide failure to value water is at the heart of 
the problem. Both believe in community 
participation in decision-making. And both 
endorse a strong role for public investment 
in water, particularly in research and in set- 
ting guidelines. There, their paths diverge. 
Where Sedlak seeks to explain the science 
and technology, Barlow seeks to expose the 
power relationships. 

In Blue Future, Barlow, a citizens’ advo- 
cate, makes a passionate plea for the societal 
change needed to honour the United Nations’ 
2010 resolution that water and sanitation are 
a human right. The powerful few, Barlow 
argues, are blocking this transformation, 
which would benefit the powerless many. Blue 
Future expands the language of the UN right 
to encompass all uses of water, and calls for 
management by a public-trust regime under 
which all competing uses for watersheds or 
aquifers would be decided according to a 
hierarchy of use managed by public agencies. 

Barlow has published some 16 books — 
spirited critiques of issues such as water 
metering and of bodies such as the World 
Bank. In Blue Future she is also critical of 
water privatization, which she sees as cover- 
ing most forms of participation by the pri- 
vate sector, including companies that own 
no part of the resource or infrastructure 
but are contracted to carry out government 
policies. (Only Britain and Chile have privat- 
ized water itself — having sold the resource 
to private companies for onward sale.) And 
she questions commodification — that is, 
the use of water markets, price mechanisms, 
purchase by beverage or mining companies, 
sale on open markets, and leases to resource 
extractors, as well as the conversion of utili- 
ties to corporatized entities. Looking at one 
such case in Ireland, Barlow somewhat 
ingenuously suggests that a water price hike 
is not needed because Ireland has a lot of 
water. Yet payments into municipal systems 
are needed to cover costs: pipes, chemicals, 
personnel, security and energy for pumping. 

However, Barlow’s primary concern (and 
Sedlak would heartily concur) is that “most 
political leaders ... create policy decisions as 
if there were no end to water supply”. That is 
the problem in a nutshell. m 


Margaret Catley-Carlson serves on a 
dozen boards, advisory committees and 
juries focused on water and agriculture, 
including the UN Secretary-General’s 
Advisory Board on Water & Sanitation and 
the Canadian Water Network. 

e-mail: m.catley-carlson@cgiar.org 


Books in brief 


We Are Our Brains: A Neurobiography of the Brain, from the Womb 

to Alzheimer’s 

D. F Swaab (translated by Jane Hedley-Préle) SPIEGEL & GRAU (2014) 

In this tour of the human brain’s often bumpy terrain, neuroscientist 

Dick Swaab argues that most of what shapes us happens in the 

womb. His survey is comprehensive, covering fetal development, 

sexual differentiation and disorders, birth, early childhood, 

consciousness, morality, memory and conditions from autism 

to Alzheimer’s disease. The vast scope of this Dutch best-seller 

y demands concision, but Swaab manages to rope each topic and 
wrestle it to the ground without breaking into a sweat. 


Trespassing on Einstein’s Lawn: A Father, a Daughter, the Meaning 
of Nothing and the Beginning of Everything 

Amanda Gefter BANTAM (2014) 

Something can come from nothing. So found Amanda Gefter: a 
question from her father about the nature of nothing propelled her 
into science journalism. In this mix of memoir and science, Gefter 
chronicles her quest to understand the big conundrums through 
study of the physics literature and meetings with remarkable 
theoreticians from John Archibald Wheeler to Lisa Randall. Her 
journey to the insight that reality is in the eye of the beholder is wittily 
told, but the reverential tone of her starry encounters may jar. 


Andrew’s Brain: A Novel 

E. L. Doctorow RANDOM HOUSE (2014) 

A cognitive neuroscientist is talking to a psychotherapist — or is ita 
prison warden? In this spiralling, scientifically savvy narrative on the 
interplay of brain and mind, distinguished novelist E. L. Doctorow 
gives us Andrew, an academic recounting his doom-ridden life in 
snapshots. Doctorow tackles consciousness, free will and memory 
with elan. The wondrous, sometimes terrifying twists of the human 
imagination are shot through with gallows humour, thought 
experiments and even political commentary — and set to a shifting, 
propulsive rhythm reminiscent of a Philip Glass symphony. 


Piero’s Light: In Search of Piero della Francesca: A Renaissance 
; Painter and the Revolution in Art, Science and Religion 
la Larry Witham PEGASUS (2014) 


2 The unearthly power of Renaissance artist Piero della Francesca’s 
'ERO: ‘ F ape ern é 
O'S Ligne works is attributable as much to skill in spatial illusion and complex 
| 


perspective as to artistic brilliance, Larry Witham shows. This study 
of Piero’s impact reveals a mathematician and geometer who 
helped to bridge the way to Galileo. Along with paintings such as 
The Flagellation of Christ — which fuse maths, classical Platonic 
science and innovative handling of light — Piero authored influential 
treatises such the Little Book on Five Regular Solids. 


Uncharted: Big Data as a Lens on Human Culture 

Erez Aiden and Jean-Baptiste Michel RNERHEAD BOOKS (2013) 

Thanks to Google Books’ digitization of millions of texts dating back 
centuries, big data is now long data. Erez Aiden and Jean-Baptiste 
Michel mine the riches using “culturomics”, quantifying history by 
graphing the occurrence of concepts and words in texts over time. In 
this lively overview, the authors reveal how the robotic historian that 
they created — the Google Ngram Viewer — has since 2010 been 
churning out analyses of everything from the efficacy of government 
censorship to the speed at which society learns. Barbara Kiser 
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Dark Universe shows dark matter as a web in space, lit up by yellow clusters of galaxies. 


ASTRONOMY 


The great unseen 


Eric Hand views a planetarium show on dark matter 
and dark energy that is both dislocating and transfixing. 


twinkling, time-lapse whorl that puts 
Ax at the centre of the Universe, and 

a comforting narrative to take you 
from Earth to outer space in stages — from 
the known to the unknown. Such is the old- 
school planetarium experience. But Dark 
Universe, the first new show for four years at 
the American Museum of Natural History's 
Hayden Planetarium in New York City, dares 
to turn the whole thing on its head. 

The 23-minute show dislocates the viewer 
by starting from a disembodied vantage 
point in the black of outer space. Smudges 
of light streak by. “Way out here, 10 million 
light years from planet Earth, every point of 
light is a galaxy containing billions of stars,” 


Dark Universe 
Hayden Planetarium, 
New York City. 

Now showing. 


intones narrator Neil 
deGrasse Tyson, the 
planetarium’s direc- 
tor. Zooming in at 
warp speed, the viewer 
stumbles on the Milky Way, the Sun and 
Earth, crashing through the planet’s atmos- 
phere to land at the 2.5-metre telescope on 
Mount Wilson near Los Angeles, California, 
where Edwin Hubble inferred the expansion 
of the Universe in the 1920s. The reversed 
ride is audacious and surprising. But it is the 
two main focuses of the show — dark mat- 
ter and dark energy — that really throw you. 

These are humbling, even alienating, con- 
cepts. Dark matter, the unknown stuff that 
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knits the Universe together, outnumbers the 
atoms that make up stars, planets and peo- 
ple by a ratio of nearly six to one. Midway 
through the show, in what he calls its Hallow- 
een scene, director Carter Emmart represents 
dark matter as a spooky black web against a 
dark-grey background. 

Dark energy is even more terrifying. This 
repulsive force was discovered in the late 
1990s, when far-off supernovae were found 
to be a little more distant than expected. 
Emmart has fun showing how the super- 
novae, the known 
luminosity of which 
is used to measure 
distance, explode 
with strobe-like 
regularity. Dark 
energy is accelerat- 
ing the expansion 
of the Universe, 
and threatens one day to hurl other galaxies 
beyond the view of the Milky Way. 

I would have liked Dark Universe to fast- 
forward to this depressing but inevitable 
conclusion, but it does not. The Hayden’s 
shows do not shy away from facts — a previ- 
ous one depicted the Sun, bloated in old age, 
encroaching ona scorched Earth — but this 
particular consequence of dark energy may 
have been a fact too far. “We don’t want to 
scare our audience,’ says Emmart. Instead, 
with streaky vector lines, he tries to explain 
a difficult, subtle concept: how dark energy’s 
accelerating effect is less apparent on the 
most distant, oldest galaxies. At the time 
these galaxies emitted the light that is now 
reaching Earth, dark energy’s repulsion was 
outmatched by the force of gravity, which 
was stronger in the dense early Universe. 

The rendering of such concepts relies on 
the digital projectors lining the rim of the 
planetarium’s dome; it would have been 
impossible with the old-fashioned bulbous 
star projectors that rose from the centre of the 
room. Unfettered, Emmart takes full advan- 
tage of the third dimension. In a spectacular 
simulation, we follow the Galileo Probe into 
the intense pressures of Jupiter's atmosphere. 
There, in the planet’s ‘cold-storage locker’, 
astronomers learned about the abundances of 
primordial hydrogen isotopes that helped to 
constrain temperatures just after the Big Bang. 

Staff at the planetarium hint that its next 
show, in two or three years, could deal with 
an altogether cosier subject: the thousands 
of extrasolar planets discovered in the past 
two decades, and the astrobiological possi- 
bilities therein. More aliens, less alienation. 
In the meantime, lean back in the Hayden's 
steeply raked seats and let your cosmic insig- 
nificance sink in. The dark Universe, not the 
blue Earth, is at the centre of it all — and it is 
man-made telescopes that tell us so. = 


Eric Hand is Nature’ US news editor. 
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Polar rescue: science 
was not well served 


Following the rescue of 
passengers from the stranded 
Russian ship MV Akademik 
Shokalskiy, the expedition’s 
leader, Chris Turney, has spoken 
out about the importance of 
science as a driver for the voyage 
(Nature 505, 133; 2014). No 
nation hesitates to aid vessels in 
distress, so why have these events 
proved so controversial? 

Turney said that the “science 
case” for the voyage was 
approved by, among others, the 
Australian Antarctic Division 
(AAD). But the AAD had no 
role in assessing, endorsing or 
approving the scientific merit of 
the expedition’s research plans. 

Turney’s expedition was a 
private venture whose research 
plans were unrelated to 
Australia's national Antarctic 
science programme, which is led 
and managed by the AAD. His 
voyage aimed to “meld science 
and adventure” and included as 
many paying tourists as it did 
scientists and students (www. 
spiritofmawson.com). 

The rescue disrupted 
the science and operations 
programmes of Australia, China 
and France, who all diverted 
their ships at the request of the 
Australian Rescue Coordination 
Centre, which ran the aid 
operation. A US icebreaker was 
also diverted. The financial cost 
of the exercise is not yet known. 

Critics have questioned 
Turney’s claims regarding the 
scientific importance of the 
voyage. The trip was relatively 
brief and seemed to involve the 
collection of routine samples. 
By contrast, research projects 
supported by national polar 
programmes are multi-year, 
multinational efforts that 
rely on sophisticated bespoke 
equipment. 

Asa result, Turney’s expedition 
has sparked yet another poorly 
informed debate on climate 
science and has seen issues 
associated with independent 
Antarctic tourism get conflated 


with the conduct of Antarctic 
science. Science is invariably the 
loser in such cases. 

Nick Gales Australian Antarctic 
Division, Kingston, Tasmania, 
Australia. 
nick.gales@aad.gov.au 


Shift aims of China’s 
poorer universities 


China’s universities and research 
institutes are experiencing 

the ‘Matthew effect, whereby 
powerful institutions become 
stronger and weaker ones become 
even weaker. It is in the country’s 
long-term interest to rectify this 
by shifting the educational focus 
of less-prestigious universities to 
help domestic job seekers. 

Chinese academic institutions 
are clamping down on the 
recruitment of research staff as the 
market becomes saturated with 
domestic PhD graduates. The top- 
tier universities (known as ‘985 
Project’ institutions; see H. Zhang 
et al. Res. Policy 42, 765-775; 
2013) set the most stringent entry 
requirements, demanding high 
qualifications from leading and 
overseas institutions. 

The aim seems to be to attract 
international talent, as outlined 
in the government's Recruitment 
Program of Global Experts, with 
a view to creating several world- 
class academic universities in 
China. To this end, the leading 
universities have already received 
a large amount of financial 
and other support. At the same 
time, ‘ordinary’ universities are 
becoming less competitive as 
they lose the power to draw in 
resources and the best people. 

To restore the balance, we 
suggest that these less powerful 
universities should focus on 
offering vocational education 
programmes (see Q. Wang 
Nature 499, 381; 2013) or 
training students for high-level 
positions outside academia (see 
Nature 472, 276-279; 2011). 
Yanhong Tang, Xin Miao 
Harbin Institute of Technology, 
Harbin, China. 
xin.miao@aliyun.com 


Rapid progress in 
producing graphene 


Your discussion on developments 
in graphene research might 

give the erroneous impression 
that we are decades away from 
acommercially viable method 

of graphene production on an 
industrial scale (see Nature 503, 
327-329; 2013). 

In fact, over the past few years 
the ability to grow large tracts 
of graphene and transfer it to 
multiple substrates, as well as 
doping, patterning and other 
techniques, have been developed 
to a point at which industrial 
production is progressing 
rapidly. You mention that the 
company Graphenea produces 
15 square metres of graphene per 
year. But other companies have 
superior production capabilities: 
US-based Bluestone Global Tech, 
for example, has the capacity to 
produce 20-200 square metres of 
graphene per day (see go.nature. 
com/gja2bo). 

The production cost you quote 
of up to US$100,000 per square 
metre is actually a retail price for 
just a few square centimetres of 
material. The cost of producing it 
is in fact much lower. Production 
costs have fallen rapidly in the 
past few months at Bluestone 
Global Tech, thanks to an 
increase in production volumes 
and new transfer techniques. 
Kostya S. Novoselov* University 
of Manchester, UK. 
kostya@manchester.ac.uk 
*On behalf of 4 co-signatories (see 
go.nature.com/utk5pk for full list). 


Use multimedia in 
grant applications 


The format of grant applications 
should be updated to incorporate 
multimedia video. This would 
help researchers to convey 
complex topics to grant-review 
panels. 

If time-poor research panels 
cannot quickly grasp the scientific 
ideas presented in a paper 
application, other factors, such 
as author affiliations and track 


records, may disproportionately 
influence project rankings. 

We contend that dense 
proposals presented in a 
more accessible multimedia 
format will enhance reviewer 
comprehension, and so preserve 
the effectiveness of the merit- 
based grant-ranking system and 
boost the research benefit from 
increasingly scarce public funds. 

Researchers already routinely 
use videos to communicate 
complex scientific concepts and 
methods at conferences and in 
online journals and seminars. 
Ironically, the US National 
Institutes of Health even uses 
YouTube video presentations to 
instruct applicants on how to 
prepare paper grant applications. 
Michael R. Doran, William B. 
Lott Queensland University of 
Technology, Brisbane, Australia. 
michael.doran@qut.edu.au 
Steven E. Doran University of 
Illinois, Urbana-Champaign, USA. 


Publish on the basis 
of quality, not gender 


The publication of research 
papers should be based on quality 
and merit, so the gender balance 
of authors is not relevant in 

the same way as it might be for 
commissioned writers (see Nature 
504, 188; 2013). Neither is the 
disproportionate number of male 
reviewers evidence of gender bias. 

Having young children may 
prevent a scientist from spending 
as much time publishing, 
applying for grants and 
advancing their career as some 
of their colleagues. Because it is 
usually women who stay at home 
with their children, journals 
end up with more male authors 
on research articles. The effect 
is exacerbated in fast-moving 
fields, in which taking even a 
year out threatens to leave a 
researcher far behind. 

This means that there are 
likely to be more men in the pool 
of potential referees. 

Lukas Koube Sherman, Texas, 
USA. 
lukas. koube@gmail.com 
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River incision revisited 


A data-set compilation suggests that measurements of river erosion into rock depend on the observation timescale, casting doubt 
on whether terraces and other incised landforms faithfully record changes in climate and tectonics. SEE LETTER P.391 


ROMAN A. DIBIASE R 


he pattern and ages of relict 

landforms, such as terraces, 

that are incised by rivers provide 
the best record of changes in Earth’s 
surface elevation over millennial to 
million-year timescales. As a result, 
much work has been devoted to deter- 
mining the age of bedrock landforms’ 
and sediment deposited on terraces” 
and in caves’, to constrain the magni- 
tude and timing of river incision. On 
page 391 of this issue, Finnegan et al.* 
challenge the perception that the ages 
of relict landforms along incised bed- 
rock rivers (Fig. 1) retain a signature of 
climatic and tectonic forcing. Instead, 
the authors argue that the intermittent 
nature of bedrock river incision means 
that attempts to measure long-term 
incision rates are inherently biased 
by the timescale over which they are 
averaged. 

Bedrock rivers set the pace of land- 
scape evolution and form an important 
connection between Earth’s tectonic 
and climatic systems. As mountain 
ranges are uplifted, increases in topo- 
graphic relief cause rivers to incise 
bedrock towards a balance between 
erosion and uplift. Furthermore, the 
ability of rivers to incise rock and 
transport sediment is controlled by 
the size and frequency of floods. Such 
floods are sensitive to changes in cli- 
mate that may in turn be driven by 
tectonic processes. In recent decades, 
provocative hypotheses’ regarding the 
potential coupling of climate, tectonics and 
erosion have impelled the need for a quantita- 
tive understanding of the mechanics of river 
incision’®. 

The idea of a link between the rates of 
change in surface elevation and the measure- 
ment timescale is well known in stratigraphy; 
this bias is known as the Sadler effect, after 
its discovery’ by Earth scientist Peter Sadler. 
In the sedimentary record, depositional his- 
tory is not continuous, but is broken by gaps 
of non-deposition or erosion, owing to the 
dynamics of sedimentary processes. As a 


Figure 1 | A flight of river terraces in the Tian Shan foreland, 
China. Finnegan et al." argue that river incision rates into bedrock, 
determined from dated landforms such as these, are biased by the 
timescale over which they are measured. 


result, measurements of deposition rates over 
timescales shorter than the longest gap will 
overestimate long-term averages in a predict- 
able manner’. Might a similar bias be present 
in net erosional landscapes? 

Finnegan et al. approached this problem by 
compiling a global data set of incision records 
from dated terraces, caves and lava flows. They 
show that, in general, incision rates decrease 
with increasing measurement interval. The 
authors combed published studies for field 
sites that have multiple levels of dated incised 
landforms spanning a wide range in age, and 
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analysed records from diverse tec- 
tonic settings — including the rapidly 
uplifting middle gorge of the Indus 
River in the Himalayas and rivers 
draining the tectonically quiescent 
Appalachian Mountains in the east- 
ern United States. They found that, in 
11 of 14 locations, apparent incision 
rates increase towards the present day. 
This relationship holds across four 
orders of magnitude in landform age, 
from thousands to tens of millions of 
years, and is independent of tectonic 
setting or landform type. 

The researchers reasoned that the 
observed scaling relationship is due to 
the intermittency of river incision, and 
they employed a random-walk model 
to show this effect. In the model, as the 
time window of observation increases, 
so does the likelihood of experienc- 
ing a long period without erosion, and 
an inverse power law scaling between 
the inferred incision rate and meas- 
urement interval emerges. Finnegan 
et al. propose that the hiatuses in river 
incision occur when sediment covers 
and protects the river bed from ero- 
sion, and are analogous to gaps in the 
sedimentary record. This implies that 
erosional as well as depositional land- 
scapes are subject to an observational 
bias when deriving rates that average 
over different time spans. 

The similarity in scaling behaviour 
for sites from a wide range of tec- 
tonic settings hints at the existence of 
a common driver for the perceived 
acceleration of incision rates, but 
there are a few factors that complicate this 
interpretation. First, abandoned flights of ter- 
races and deep valleys that preserve incision 
markers are the precise landforms expected 
for landscapes undergoing an actual increase 
in erosion asa result of changes in climate or 
tectonics’. Further independent information 
from these landscapes is needed to provide 
constraints on the plausibility of the steady- 
state erosion implicitly assumed by Finnegan 
and colleagues. 

Second, although there is little argument 
that river incision occurs only intermittently, 


L. C. MALATESTA 


and that deposition in response to landslides 
can protect river beds from eroding over mil- 
lennial timescales”, it is unclear what physical 
processes drive hiatuses in incision over time- 
scales of 1 million to 10 million years. Such 
long timescales span multiple glacial-inter- 
glacial climate cycles, and may also reflect a 
connection to deep Earth processes. Perhaps 
the most intriguing implication of Finnegan 
and co-workers’ study is the idea that long- 
term rates of landscape lowering may be more 
sensitive to the frequency and magnitude of 
depositional events than to the mechanics of 
river incision into bedrock. 

If the authors’ findings hold true, a natural 
question arises: can changes in the pace of 
landscape evolution be deciphered from net 
erosional landscapes? Work on depositional 
landscapes shows that the preservation bias 
in one-dimensional stratigraphy disappears 
when the spatial distribution of both pre- 
served sediments and hiatuses within a basin 
are taken into account’. By incorporating 
a similar spatial averaging in erosional land- 
scapes’”"*, or by accounting for changes in 
hillslope lowering over time using different 
chronometers"™, it may be possible to overcome 
some of these biases. 

Finally, to understand the influence of 
tectonics, land use or climate change on ero- 
sion rates, we need a robust way to compare 
rates measured over different time intervals. 
Although this is a challenging task, charac- 
terizing the degree to which these rates are 
unsteady, by studying the processes that con- 
trol erosion and deposition, is essential for 
interpreting rates measured over different 
periods of Earth’s history and for predicting 
future change. = 


Roman A. DiBiase is in the Division of 
Geological and Planetary Sciences, California 
Institute of Technology, Pasadena, California 
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BIRD FLIGHT 
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Fly with a little flap 
from your friends 


In-air measurements of northern bald ibises flying in a V formation show that 
the birds conform to predictions for saving energy by regulating their relative 
body position and synchronizing their flapping motion. SEE LETTER P.399 


FLORIAN T. MUIJRES & 
MICHAEL H. DICKINSON 


he elegant V formations of migrating 

birds provide a picturesque harbin- 

ger of summer’s end, but why do the 
birds fly in such a precise formation? There 
are rumours that Allied bomber pilots during 
the Second World War noticed that their fuel 
economy increased when their squadrons flew 
in a V formation. Although these apocryphal 
tales have not been confirmed, the energy- 
saving benefits of formation flying have been 
reported in both civil' and military” avia- 
tion. For example, by maintaining one wing 
tip in the wake of a forward plane, a fighter 
jet can reduce its energy consumption by up 
to 18% (ref. 2). However, exploiting the ben- 
efits of formation flight is more challenging 
for birds than for fixed-wing aircraft — birds 


Tip vortex 


i Leading bird 


Trailing bird fy 


not only need to adjust their position rela- 
tive to each other, but also must synchronize 
their wingbeat patterns’. On page 399 of this 
issue, Portugal et al.‘ show that at least one bird 
species exhibits the requisite synchroniza- 
tion of body position and flapping motion to 
reduce energetic cost during migratory flight. 

The principle by which formation flight 
saves energy derives from the way wings dis- 
turb the air as they move’. To create lift, wings 
accelerate airflow over their top surface com- 
pared with their bottom surface. Thus, rela- 
tive to the still air through which they move, 
wings create a net circular flow of air that is 
directed rearward over the top surface and for- 
ward under the bottom surface. The greater 
the circulation a wing creates, the higher the 
lift it produces. At each wing tip, however, the 
circulation around the wing rolls up into a tip 
vortex, which extends backward like a tube, 


b G 
; Leading bird 


Trailing bird 


Figure 1 | Spatial synchrony. Flight creates a looping motion of air around a bird’s wings; at the wing tips 
this circulation forms a vortex, creating air movement that extends behind the bird. Airflow down the 
middle of this wake is directed downwards (the downwash; red), whereas the area outside the tip vortices 
is a region of upwash (blue). A bird flying behind another bird experiences the aerodynamic forces of the 
downwash and upwash created by the leading bird (a, side view; b, rear view). Portugal and colleagues* 
show that northern bald ibises spatially synchronize their wing movements while flying in a V formation, 
such that the trailing bird’s wing moves through the area of maximum upwash created by the leading bird. 
This results in more-efficient lift production and energy savings. 
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creating a horseshoe-shaped structure — the 
wake — that extends as the animal moves 
forward (Fig. 1a). 

Owing to the laws of conservation, the cir- 
culation around the wings is equal in strength 
to the circulation around each of the two tip 
vortices. Air flows down through the middle of 
the wake, and the steady change in momentum 
as this downwash region elongates is equal to 
the lift created by the wings. Any animal flying 
behind another should avoid the downwash 
region, which would literally push them earth- 
ward. Just outside the downwash zone, however, 
there is a small region of upwash, created by the 
circular flow of air in the tip vortices. By careful 
placement ofits own wing tip, a trailing bird can 
exploit the upwash generated by the tip vortex 
of a leading bird, thereby generating lift more 
efficiently and reducing its flight cost (Fig. 1b). 

For an aeroplane pilot, keeping one wing 
precisely within the small upwash region ofa 
leading plane's tip vortex is tricky enough, but 
for a bird the problem is further complicated 
by the flapping wings of its neighbours, which 
create tip vortices that undulate up and down. 
A bird that is following another bird must care- 
fully adjust its own flapping motion, not in per- 
fect temporal synchrony with the leader, but 
rather at a precise phase lag that tracks the tip 
vortex as it oscillates. When flying most effi- 
ciently, all the birds in a formation should flap 
with a precise metachrony (a wave-like syn- 
chrony), such that the flapping phase changes 
systematically from the leader to each bird 
down the line. Several theoretical studies*** 
have predicted how birds flying in formation 
could optimize energy savings by tuning their 
spacing and wing motion, and geese flying in 
a V formation have been observed to align 
their body positions in a way that might save 
energy’. But until now, no experimental data 
have shown that birds are capable of the precise 
adjustment of flapping phase that is necessary 
to track undulating tip vortices. 

Northern bald ibises (Geronticus eremita) 
often fly in a V formation when they migrate. 
To examine their behaviour during forma- 
tion flight, Portugal and co-workers mounted 
custom-built data loggers on 14 ibises that accu- 
rately measured the body position and flapping 
dynamics of each bird. The authors found that 
trailing birds flew so as to keep their inner wing 
in the upwash zone of the bird in front of them, 
just as predicted by theory’. This requires not 
only correct regulation of body position, but 
also proper adjustment of the flapping phase, so 
that each bird’s wing tip follows the undulating 
tip vortex of the individual in front of it. Because 
the birds occasionally shifted position within 
the formation, situations occurred in which 
trailing birds briefly flew directly behind a lead- 
ing bird. In these situations, the trailing birds 
tended to flap their wings in strict antiphase 
with the leading bird, thereby minimizing the 
negative interactions with the downwash region 
of the wake. This change in behaviour indicates 


that the ibises actively adjust their flapping 
pattern under different conditions. 

Although these findings are qualitatively 
consistent with theoretical predictions, many 
challenging questions remain. For example, 
how much energy do the birds actually save? 
The best existing evidence that V formations 
save a significant amount of energy is that peli- 
cans have a lower heart rate and show reduced 
flapping frequency when flying in formation 
compared with when flying solo’. Accurate 
measurements of metabolic rate will be cru- 
cial for a more precise understanding of the 
underlying aerodynamics of formation flight 
and for greater insight into the ecology of bird 
migration. Do ibises and other birds instinc- 
tively flap in an efficient manner when flying 
in formation, or do they learn to adjust their 
body position and wing motion because it 
‘feels easier? And if the strategy is so useful, 
why do many species of small migrating birds 
not fly in a V formation? Might the benefits 
of formation flying decrease with body size, 
or is the requisite control of body position 
and wing motion more difficult for smaller, 


ASTROPHYSICS 


faster-flapping birds? Although our under- 
standing of V formations has improved, there 
is still much to ponder when looking skyward 
on late summer days. m 
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Black hole found 
orbiting a fast rotator 


Stars of spectral type ‘Be’ are often found with neutron stars or other evolved 
analogues, but a black- hole companion has never been spotted before. Optical 
emission from a black hole’s surroundings has given it away. SEE LETTER P.378 


M. VIRGINIA MCSWAIN 


tellar-mass black holes, which are formed 
from the gravitational collapse of massive 
stars, are unusually scarce in our Universe. 

It is not yet clear whether there are fewer of 
them than expected or whether they are just 
hard to find, but either way they are deserving 
of their exotic reputation. Therefore, the dis- 
covery by Casares et al.', reported on page 378 
of this issue, of a stellar-mass black hole orbiting 
around a star dubbed MWC 656 is like find- 
ing a needle in a haystack. This black hole does 
not emit X-ray radiation — as black holes are 
expected to do — so it could be the first sign 
of a large population of ‘quiescent’ black holes. 
MWC 656 is itself interesting because it 
is surrounded by a dense outflow from the 
star’s equator caused by a combination of its 
fast rotation (the projected rotational velo- 
city’ is about 300 kilometres per second) and 
pulsations that can eject material from the 
equator’. The resulting circumstellar disk pro- 
duces spectral emission lines from hydrogen 
and other elements, meriting its classification 
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as a ‘B-emissior or ‘Be star. Fast rotation is 
a requirement in the formation of Be stars, 
and they probably acquire their high angular 
momentum during mass transfer from a mas- 
sive companion star that eventually explodes 
as a supernova. In fact, many Be stars are found 
with highly evolved companions, usually 
neutron stars that are remnants of the post- 
supernova massive stars*. A few Be stars have 
companion stars that have been stripped down 
to just their helium cores’, but such a hot object 
is ruled out in the case of MWC 656 because 
there is no significant ultraviolet-light contri- 
bution coming from anywhere other than the 
Be star. MWC 656 is the first Be star to have a 
black-hole companion detected (Fig. 1). 
Casares et al. have identified emission lines 
from helium plasma that is trapped in an 
accretion disk around the black hole, as well 
as emission from the disk around the Be star, 
that provide a robust measurement of the 
mass ratio between the star and the black hole. 
Such emission lines are notoriously difficult to 
measure: they are broad and often asymmet- 
ric, complicating the usual procedures used to 


Black hole 


Circumstellar disk - 


Be star 


—— Accretion disk 


Figure 1 | A Be star with a black-hole companion. Casares et al.' have detected a quiescent black 

hole circling a Be star by measuring optical emission from the black hole’s accretion disk and from the 
large disk around the star. As the black hole orbits the star, some material from the circumstellar disk is 
transferred to the accretion disk. However, the high angular momentum of material in the accretion disk 
inhibits it from falling into the black hole, so there will be a gap between the accretion disk and the black 


hole, and the black hole remains quiescent. 


measure the line centres, but the authors have 
taken care to reduce any underlying systematic 
errors. Taken together with the mass of the Be 
star, which is about 10-16 solar masses, the 
measured ratio implies that the black hole has 
amass of between 3.8 and 6.9 solar masses. 

In studies of stellar evolution, conventional 
wisdom tells us that stellar-mass black holes 
form during the collapse of the cores of very 
massive stars — those with masses more than 
25 times that of the Sun® — once the stars 
exhaust their fuel, and that the collapse is pos- 
sibly accompanied by a supernova. The super- 
novae that massive stars (8-25 solar masses) 
undergo are expected to produce neutron 
star remnants instead. These massive stars 
tend to form within close groups of stars, so 
binary star systems are the norm, and triple 
and quadruple systems are not unusual. The 
catastrophe of a supernova in a binary has dra- 
matic consequences: if more than half of the 
total-system mass is lost, or if ‘kick velocity 
from the explosion propels the newly formed 
supernova remnant with enough momentum, 
the remnant and companion star could fly off 
in opposite directions’. But if the companion 
star does remain gravitationally bound to the 
remnant, an X-ray binary is formed: the black 
hole or neutron star remnant interacts with the 
remaining star to produce X-ray emission. 

Theorists predict that stellar-mass black holes 
are abundant. If this is so, we should find them 
all over the Milky Way. Many of them ought 
to be bound in X-ray binaries, whereas others 
should be freely floating through space. There 
are probably tens of millions of massive stars 
in the Milky Way that could potentially col- 
lapse into black holes, but there are only about 
50 stellar-mass black holes known with good 
confidence’. X-ray studies of young star-form- 
ing regions such as the Carina Nebula, which 
might contain at least a few recent supernovae 
products, have not found any black holes’. Even 
large sky surveys are coming up with little as 
they search for the subtle brightness variations 


of stars whose light bends around a foreground 
black hole that passes in front of the star (an 
effect known as microlensing)’*. 

MWC 656 presents a rare opportunity to 
study mass transfer, angular momentum and 
accretion-disk physics around a quiescent black 
hole. Casares et al. find a hint of a hotspot on 
the black hole’s accretion disk that suggests 
that mass is pulled away from the Be star’s 
disk, crashing into the accretion disk when 
the stars make their closest approach during 
their orbit around one another. The absence 
of X-ray emission from this system is evidence 
that material is not channelled into the black 
hole; rather, it must be retained in a holding 
pattern within the accretion disk. Gas in the 
outer regions of the Be star’s disk will have high 
angular momentum, which will be transferred 
to the accretion disk during the mass transfer. 
Without an efficient mechanism to remove 
this angular momentum, accretion will be sup- 
pressed and the black hole will remain quiet. 
If there exists a larger population of Be star- 
black-hole binaries, such quiescence is prob- 
ably the rule, not the exception. Casares et al. 
have shown us a way to find them. = 
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50 Years Ago 


Space Carrier Vehicles by Oswald 
H. Lange and Richard J. Stein — 
The book begins with a useful 
conspectus of United States space 
launching missiles from the Juno 1 
to the Saturn C-5:a list of their 
achievements is included, which 
shows the Thor-Agena B to be 

well in the lead, with 39 successful 
launches before the end of 1962 

... Further subjects discussed are 
inertial guidance and control, 

the fabrication of the missiles 
(including an informative series of 
photographs of the Saturn vehicle 
under construction) ... and, finally, 
the layout and construction of 
launching sites (with photographs 
of the Saturn launch complex at 
Cape Kennedy) ... The book shows 
a bias in favour of German or 
American achievements: p.1 gives 
the impression that the first satellite 
was launched by the United States. 
From Nature 18 January 1964 


100 Years Ago 


An article in Engineering for 
January 9 directs attention to 

the waning supply of petroleum. 
Although a continually greater 
supply of petroleum is being 

placed on the market, this increased 
output is secured only by sinking 
more wells and boring to a greater 
depth, showing that the surface 
supply is becoming exhausted. At 
the beginning of this century the 
wells touched 1100 ft., and to-day 
the average level of the oil may be 
placed at 2000 ft. — an ominously 
rapid rate of sinking ... America, 
by reckless expenditure of her 
resources, has increased her annual 
output to 200 million barrels, yet 
the demand for oil for special 
purposes has become so great that 
the rise in price is considerable — 
so great, indeed, that competition 
with coal for ordinary purposes has 
become impossible. 

From Nature 15 January 1914 
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Those who cultivate manicured lawns curse 
‘fairy rings’ of mushrooms (pictured) and 

the rapid grass growth associated with them. 
The compound that stimulates this growth, 
2-azahypoxanthine (AHX), was isolated from 

a fungus in 2010, but Choi et al. now report in 
Angewandte Chemie that plants also produce 
it U.-H. Choi et al. Angew. Chem. Int. Edn http:// 
doi.org/f2phhs; 2014). 

The authors treated several plants 
with AHX, and observed that it was 
metabolized to a compound called 2-aza- 
8-oxohypoxanthine (AOH). They went on to 
show that both AHX and AOH are produced 
by plants, and are present in rice at levels 
similar to those of plant hormones. 

Choi et al. found that a member of the 
purine metabolic pathway is converted to AHX 
and AOH in rice, and they extracted crude 
enzymes that catalyse the reactions involved 
from rice and Arabidopsis, a model plant. They 
conclude that AHX and AOH are formed ina 
previously unknown metabolic pathway. 

Intriguingly, AOH stimulates rice growth, 
albeit not as much as AHX. Frustrated haters 
of fairy rings could perhaps take heart from 
the thought that both compounds hold 
promise for horticulture. Andrew Mitchinson 
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The tug of DNA repair 


The transcription enzyme RNA polymerase stalls at DNA lesions, hindering their 
repair. Accessory factors dislodge the enzyme by pushing it forwards, but a study 
finds that pulling it backwards may also be effective. SEE ARTICLE P.372 


IRINA ARTSIMOVITCH 


NA damage caused by genotoxic 
D agents, from solar ultraviolet light to 
free radicals generated during cel- 
lular metabolism, is unavoidable. Unless 
repaired, such damage may directly threaten 
cell survival or lead to mutations and dis- 
ease. All organisms therefore rely on repair 
systems to maintain DNA integrity, with 
many components of those systems highly 
conserved throughout evolution. In this 
issue, Epshtein et al.' (page 372) describe 
how two enzymes — an RNA polymerase 
and UvrD, a helicase — cooperate to target a 
damaged site for repair. 
RNA polymerase (RNAP) carries out 
the essential job of transcribing DNA into 
RNA. When RNAP runs into a lesion in the 


transcribed (template) strand, it backtracks 
along the DNA, but still occludes the dam- 
aged site (Fig. la, b), an effect that might be 
expected to hinder DNA repair. Unexpected 
observations’ that, in vivo, damage is prefer- 
entially repaired in transcribed rather than 
non-transcribed regions led to the discovery of 
transcription-coupled repair (TCR), a branch 
of the nucleotide-excision repair (NER) path- 
way that removes many lesions, including 
ultraviolet-induced thymine dimers’. 

NER of naked DNA in bacteria involves a 
complex of Uvr proteins: UvrA and UvrB rec- 
ognize the lesion; UvrC excises the damaged 
segment; and UvrD helicase displaces it. The 
resulting gap is then enzymatically repaired 
by two enzymes, DNA polymerase and a 
ligase. In TCR, RNAP takes over the damage- 
recognition task, but has to be removed to 
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allow the Uvr complex to access the lesion. 

Ina broadly accepted model for TCR, a pro- 
tein known as Mfd pushes the stalled RNAP 
forwards (Fig. 1c), releasing it from the DNA‘, 
and recruits UvrA’, thereby directly coupling 
transcription to repair. Intriguingly, bacterial 
strains lacking Mfd are quite resistant to dam- 
age caused by ultraviolet light’, implying that 
alternative means of repair exist. Epshtein et al. 
describe a TCR pathway of opposite polarity to 
the conventional mechanism, wherein UvrD 
pulls RNAP even further backwards, exposing 
the lesion for repair (Fig. 1d). 

The role of UvrD in NER is well established 
— it acts late in the pathway, after RNAP has 
dissociated from the DNA and the UvrABC 
proteins have processed the lesion. However, 
Epshtein and colleagues observed that, in the 
bacterium Escherichia coli, UvrD interacts with 
RNAP as frequently as the general transcrip- 
tion factors NusA and NusG. This finding 
prompted the authors to re-examine the role 
of UvrD in TCR. 

Their analysis convincingly demonstrates 
that UvrD forms a binary complex with RNAP 
and induces it to backtrack both in vivo andin 
an in vitro system. The researchers also found 
that UvrD relieves an RNAP-imposed block 
to UvrABC excision of a thymine dimer in a 
minimal NER system reconstituted in vitro. 


WALLY EBERHART/VISUALS UNLIMITED/CORBIS 


Cells lacking UvrD are highly sensitive to sev- 
eral genotoxic agents that can induce NER, 
but Epshtein et al. observed that this effect is 
reversed when the two known anti-backtrack- 
ing processes are inhibited so that RNAP can 
retreat even in the absence of UvrD. Moreover, 
the authors found that, in vitro, UvrD activities 
require an energy source (ATP nucleotides), 
consistent with the enzyme’s function as a 
motor protein. 

UvrD translocates on a single DNA strand 
to displace DNA-bound proteins, an activity 
thought to be mechanistically distinct from its 
helicase activity, which separates (melts) the 
two DNA strands’. The melted non-template 
DNA strand is a target for several transcrip- 
tional regulators. Could UvrD slide on the 
non-template DNA to push on RNAP? 

Consistent with this model, Epshtein et al. 
showed that UvrD interacts with the non- 
template strand. They then sought to identify 
amino-acid residues of UvrD and RNAP that 
make contacts between the two molecules, 
using a powerful experimental approach’ that 
allows the construction of a high-confidence 
three-dimensional map of the complex, even 
though a structure of the complex has not 
been determined. This analysis identified 
the B-subunit flap domain of RNAP as a con- 
tact site for UvrD. The mapping results place 
UvrD at the upstream end of the transcrip- 
tion bubble, the structure that forms when 
part of the DNA helix is unwound by RNAP 
(Fig. 1). This is a logical location for push- 
ing the RNAP backwards, and may explain 
the involvement of NusA — which also 
interacts with the 8 flap — in NER’. Indeed, 
Epshtein and colleagues show that NusA 
cooperates with UvrD in vitro and may favour 
backtracking in vivo. 

UvrD was first implicated in DNA repair 
more than 40 years ago (see ref. 8, for exam- 
ple), but even the most basic questions about 
its role remain matters of debate. It is not 
known whether UvrD functions as a helicase 
or a translocase to carry out its diverse func- 
tions, or as a monomer or a dimer. The current 
study does not answer any of these questions, 
and instead poses a few more; for example, 
does UvrD bind only to an RNAP stalled at 
a lesion, and how does it compete with other 
regulators that bind to the non-template 
DNA? It also calls for textbooks to be revised: 
rather than (or in addition to) displacing the 
damaged DNA segment, UvrD displaces the 
lesion-stalled RNAP. This conceptual shift will 
undoubtedly reignite interest in UvrD, which 
should, in turn, help to address the outstand- 
ing questions. 

The UvrD-dependent TCR pathway seems 
to be dominant in E. coli, but is it universally 
conserved? It relies on backtracking, a mecha- 
nism widely used by cellular RNAPs to regulate 
gene expression’. Yeast pol II, the RNAP that 
transcribes most yeast genes, is structurally 
similar to the bacterial enzyme, retreats at 
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Figure 1 | Alternative routes for transcription-coupled repair in bacteria. a, b, Ina transcription 
complex, the two DNA strands separate to form a bubble; the non-template strand (purple) is exposed 

on the surface of RNA polymerase (RNAP); the flap forms part of the RNA exit channel. When RNAP 
encounters a DNA lesion (orange star), it backtracks, threading the nascent RNA into the secondary 
channel and blocking access to the lesion. c, Forward translocation of RNAP induced by sliding of the 
Mfd protein (yellow arrow) on the double-stranded DNA releases RNAP, allowing repair and generating 
naked, intact DNA. d, Epshtein et al.’ report that backward translocation (green arrow) induced by UvrD, 
which binds to the non-template strand and the 6 flap, exposes the lesion; the transcription factor NusA 
may stabilize the UvrD--flap interaction. Following repair, the arrested RNAP must be reactivated to 
position the 3’ RNA end in the active site, or removed (for example, by Mfd). 


lesions and associates with Ssl2, a homologue 
of human XPB helicase, which has the same 
translocation polarity as UvrD and is essen- 
tial for NER. Mutations in XPB are linked to 
xeroderma pigmentosum, a human disorder 
associated with a high risk of developing skin 
cancer. 

But, unlike UvrD, XPB is thought to 
melt and load the DNA into RNAP. It also 
binds at the ‘wrong’ place, contacting the 
downstream DNA and a transcription fac- 
tor (TFIE) that occupies the upstream part 
of the would-be bubble in the transcrip- 
tion complex when it is poised to start syn- 
thesis'®. TFIIE subsequently dissociates 
from RNAP but is replaced by a NusG-like 
factor, which forms a processivity clamp” 
(a structure that enables uninterrupted 
RNA synthesis in all domains of life). NusG 
homologues bind to the non-template strand 
and would be expected to exclude XPB (or 
UvrD). Could the clamp be lost at a lesion? 
Dissecting the interplay between these and 
other regulators will require the powerful 


mapping approaches that helped Epshtein 
and colleagues to establish UvrD asa key factor 
for TCR. = 
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and important developments in biology, ranging 
from the subcellular to the organismal level, and 
including molecular mechanisms and biomedicine. 

The collection begins with a discussion of state-of- 
the-art cancer-predisposition-gene discovery. Nazneen 
Rahman shows how these genes can provide insight into 
the mechanisms of cancer causation, and how their clinical 
use has had a substantial impact on diagnosis, optimized 
management and cancer prevention. 

Next, Karl Deisseroth discusses how analysing brain 
function at the level of neural projections can bea fruitful 
approach towards understanding the basis of behaviours 
relevant to psychiatric diseases. Technical advances have 
allowed perturbation of the activity of specific components 
of neural circuits, enabling causal testing of their functions. 

Adam Kepecs and Gordon Fishell go on to consider the 
organization of interneurons, which perform a crucial role 
in regulating inhibition within neural circuits. Owing to 
their diverse morphology, connectivity and physiology, 
interneurons have by and large defied generalized 
classification. Here, the authors explore data supporting 
an organization in which the cells are defined bya 
developmentally specified set of cardinal classes. 

Sean Morrison and David Scadden highlight recent 
progress in our understanding of the haematopoietic 
stem-cell niche in bone marrow, and discuss how studying 
the involvement of the microenvironment in normal 
and disease physiology promises new approaches to treat 
haematological disorders. 

Further advances, this time in genetics and systems- 
based approaches, have resulted in a renaissance in 
mitochondrial research. Jonathan Friedman and Jodi 
Nunnari discuss how this resurgence is both redefining 
and extending our knowledge about mitochondrial 
behaviour and communication. 

In the penultimate Review, Pier Paolo Pandolfi and 
colleagues discuss the ways in which various forms of 
endogenous RNA species can interact with and alter the 
expression or stability of other RNAs. 

Finally, Robert Ross, Molly Weiner and Haifan Lin 
discuss recent studies that hint at the intriguing roles of 
piRNAs — originally considered to be germline-specific 
regulatory RNAs — in somatic cells. 

Francesca Cesari, I-han Chou, Angela K. Eggleston, Noah 
Gray, Deepa Nath, Sadaf Shadan & Magdalena Skipper 
Senior Editors 
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Realizing the promise of 
cancer predisposition genes 


Nazneen Rahman’? 


Genes in which germline mutations confer highly or moderately increased risks of cancer are called cancer predisposi- 
tion genes. More than 100 of these genes have been identified, providing important scientific insights in many areas, 
particularly the mechanisms of cancer causation. Moreover, clinical utilization of cancer predisposition genes has had a 
substantial impact on diagnosis, optimized management and prevention of cancer. The recent transformative advances in 
DNA sequencing hold the promise of many more cancer predisposition gene discoveries, and greater and broader clinical 
applications. However, there is also considerable potential for incorrect inferences and inappropriate clinical applica- 
tions. Realizing the promise of cancer predisposition genes for science and medicine will thus require careful navigation. 


enetic predisposition to cancer has been recognized for 

centuries, initially through observation of unusual familial 

clusterings of cancer. In 1866, neuroanatomist Paul Broca 
published one of the earliest reports on the subject, detailing a strik- 
ing history of breast cancer in 15 members of his wife’s family’. Broca, 
controversially for the time, proposed that this was evidence of hered- 
itary predisposition to cancer. Fifty years later, biologist Theodor 
Boveri published his visionary theory that somatic acquisition of 
“particular, incorrect chromosome combinations” underlies cancer. 
His paper was equally prophetic about inherited predisposition to 
cancer, predicting it could result from “weakened resistance against 
the action of factors that stimulate cell division”’. In 1971, mathemati- 
cal modelling of the epidemiology of retinoblastoma by geneticist 
Alfred Knudson suggested a ‘two-hit’ model whereby both alleles 
of a specific gene were required to be inactivated for retinoblastoma 
to occur’, thus echoing Boveri's predictions. In 1987, the retinoblas- 
toma predisposition gene RBI was discovered, and in hereditary 
cases one allele was mutated in the germ line with the second allele 
inactivated somatically’. 

There is no definitive definition of a cancer predisposition gene 
(CPG). For the purposes of this Review, I have restricted inclusion to 
those genes in which rare mutations confer high or moderate risks of 
cancer (greater than twofold relative risks) and to those for which at 
least 5% of individuals with the relevant mutations develop cancer. 
For most genes, both the risks and penetrance are considerably higher 
than these minimum criteria. Common variants conferring very 
small increases in risk discovered through genome-wide association 
studies (GWAS) are not included within this definition. Such variants 
are important components of the genetic architecture of cancer and 
are addressed in other reviews”’. Through extensive literature and 
database evaluations, I identified 114 CPGs that form the basis of this 
Review (Fig. 1; see Supplementary Information). 

Conventionally, cancer predisposition reviews have focused 
on select cancers and/or sets of genes. In an era of whole-genome 
sequencing, information about all known CPGs is increasingly 
desirable in both research and clinical practice. Here, I have 
aspired to integrate knowledge from three decades of research to 
provide a distillation of key CPG characteristics. I also discuss the 
exciting prospects and potential pitfalls of future discoveries and 
clinical applications. 


Discovery of CPGs 
The 114 CPGs were discovered over the past 30 years through multiple 
strategies (Fig. 2). Since 1990, at least one new CPG has been identified 
each year, peaking in 1996, when 10 CPGs were reported*"®. Genome- 
wide linkage analysis, an agnostic approach that allows tracking of dis- 
ease-associated genomic markers in high penetrance familial clusters has 
been the most successful strategy, yielding 59 CPGs, mostly during the 
1990s when the methodology became routine. Next-generation sequenc- 
ing is leading to a new crop of CPGs being discovered through genome- 
wide mutational analyses such as exome and genome sequencing”. 
The remaining genes were identified through various candidate-based 
strategies. Large numbers of candidate CPGs have been proposed and 
investigated. For most, no association with cancer predisposition has been 
found. This perceived failure led to candidate-based approaches falling 
out of favour. However, certain strategies have proved very successful 
both as stand-alone discovery methods and in facilitating linkage studies. 
Candidates pursued as surrogates of cancer predisposition — for exam- 
ple distinctive cellular phenotypes such as defective DNA repair, mosaic 
aneuploidies or telomere shortening — have contributed to the discovery 
of many CPGs”'*"””°. Genetic-pathway candidates, that is, genes selected 
because they function in similar pathways to known CPGs, have also 
yielded new predisposition genes, particularly in colorectal, breast, ovar- 
ian and endocrine cancers”’**. Candidate genes that have been chosen 
because they are somatically mutated in cancers have, perhaps surpris- 
ingly, led to the identification of only 12 CPGs”. 


Overlap of somatic and germline cancer genes 

It is interesting and instructive to consider the overlap between the known 
germline and tumour-mutated cancer genes. Currently, the COSMIC 
(Catalogue of Somatic Mutations in Cancer) database includes 468 genes 
that are somatically mutated in cancers”. Of these, 49 are also known to 
be CPGs. Conversely, 65 of the 114 CPGs are known to be somatically 
mutated. These data imply that 10% of somatically mutated cancer genes 
also confer susceptibility to cancer when mutated in the germ line, but that 
40% of germline-mutated CPGs can also contribute to oncogenesis when 
mutations occur only in tumours (Fig. 3). This apparent discrepancy is, 
at least in part, an artefact of different research approaches; it is common 
for the frequency of somatic CPG mutations to be investigated, but it is 
unusual for somatically mutated genes to be evaluated for their role in 
cancer predisposition. The latter has been exacerbated by cancer genome 
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Figure 1 | Chromosomal locations of 114 cancer predisposition genes. Gain-of-function mutations in genes that predispose carriers to cancer are shown in 
red. Loss-of-function mutations in genes that predispose carriers to cancer are shown in black. 


sequencing studies in which a key filtering step is the removal of vari- 
ants that are present in normal tissue to focus attention on the potential 
cancer-driving mutations that are present only in the tumour™. Thus, it is 
highly likely that there is an underestimate of the overlap between somatic 
cancer driver mutations and germline cancer predisposing mutations, and 
mutual interrogation of such genes could prove to be a useful approach 
for identification of new cancer genes. 


Overlap of high and low penetrance cancer-associated variants 

One notable absentee in the successful CPG discovery strategies is iden- 
tification through GWAS candidature. There are 391 known common 
variants that confer small increased risks of cancer*’. The underlying 
causal gene or mechanism has only been identified for a small minority, 
but none have been shown to be sentinels of rare and higher penetrance 
mutations in new CPGs. Furthermore, only 15 SNPs (single nucleotide 
polymorphisms) are within known CPGs and none are associated with 
cancers that occur in carriers of rare, penetrant mutations. For example, 
ATM D1853N (the only exonic cancer GWAS variant in a CPG) confers 
a protective effect against melanoma, but it is not associated with the 
cancers that occur in biallelic or monoallelic ATM mutation carriers”. 
Similarly, the SNP rs78378222 alters the polyadenylation signal of TP53 
and is associated with various cancers, but not those that typically occur 
in patients with the cancer predisposition condition Li-Fraumeni syn- 
drome, which is due to germline TP53 mutations”. Multiple variants in 
the vicinity of TERT have been associated with several different cancers, 
but again not those that occur in the multisystem disorder dyskeratosis 


congenita, which is a recessive condition due to germline exonic TERT 
mutations”. These data suggest the mechanisms underlying the associ- 
ation of rare, high penetrance alleles and common, low penetrance alleles 
with cancer are largely distinct. This differs from several other common, 
complex conditions that show considerable overlap between these com- 


ponents of the genetic architecture”. 


Characteristics of CPGs 
The 114 genes are located throughout the genome with little evidence of 
chromosomal clustering (Fig. 1). 


Inheritance and mechanisms of oncogenesis 
The inheritance pattern of cancer predisposition is varied; it is autosomal 
dominant for 65 CPGs, autosomal recessive for 28, X-linked for 4 and 
Y-linked for 1. Sixteen genes cause phenotypes in both monoallelic and 
biallelic mutation carriers, that is, they cause autosomal dominant and 
autosomal recessive conditions. For half of these, the recessive condition 
is a more severe manifestation of the dominant condition. For example, 
biallelic BRCA2, PALB2, MLH1, MSH2, MSH6 and PMS2 mutation car- 
riers have a high risk of childhood cancer, whereas monoallelic mutation 
carriers have an increased risk of adult cancers”. For other genes in which 
monoallelic and biallelic mutations cause clinical phenotypes, such as FH 
and SDHA, cancer has not been reported in biallelic mutation carriers. 
This may be because of early mortality from other causes. 

Most CPGs act as tumour suppressor genes with mutations that abro- 
gate their function, promoting oncogenesis. Only 11 genes predispose 
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to cancer through gain-of-function mutations. Several of these, such as 
RET, MET, KIT and ALK, encode kinases that are rendered constitutively 
active by cancer predisposing mutations”. Much more diversity exists in 
the types, functions and mechanisms of oncogenesis of CPGs that when 
inactivated increase the risk of cancer. Many are classic tumour suppres- 
sor genes, requiring both alleles to be inactivated, but haploinsufficiency 
and dominant-negative mechanisms also occur. For some genes, differ- 
ent mutations operate through different mechanisms and lead to distinct 
phenotypes™”. For many genes, the clinical phenotypes and cancer risks 
associated with CPG mutations are also influenced by other factors, both 
genetic and non-genetic. 


Functions of CPGs 
CPG research has directly resulted in insight into basic biological path- 
ways and gene function. Several essential genes were first isolated because 
germline mutations within them predispose cancer, as is transparent in 
their names (for example, breast cancer 1, BRCAI; or retinoblastoma 1, 
RB1), which reflect the clinical phenotypes that led to their identification. 
The functions of these genes were only subsequently elucidated, often 
directly because of research into their role in cancer predisposition. 
CPGs have a very broad range of functions. Many are ubiquitously 
expressed and participate in fundamental processes such as DNA repair 
and cell-cycle regulation. One of the enduring conundrums of cancer pre- 
disposition is why, and how, perturbation of universal cellular functions 
can cause exquisitely specific cancer phenotypes. However, some genes 
have organ-specific functions that are transparently related to the cancers 
with which they are associated. For example, mutations in SLC25A13, 
ABCB11, FAH, HMBS and UROD all lead to hepatic overload, liver cir- 
rhosis and, hence, an increased risk of hepatocellular carcinoma”. 
Meaningful evaluation of the 114 CPGs for functional associations is 
currently precluded because so many were identified owing to their func- 
tional relationships with known CPGs. Nonetheless, some noteworthy 
functional networks are emerging as important in cancer predisposition, 
in addition to well-recognized pathways such as DNA repair. Among 
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Figure 2 | Timeline of cancer predisposition gene (CPG) discovery. CPGs 
were first identified by candidate gene approaches (blue). As genome-wide 
linkage analysis (green) became increasingly routine, this led to the discovery 
of many CPGs. Genome-wide mutation analyses, such as exome and genome 
sequencing (red), are leading to the discovery of a new crop of CPGs. Since 
1990, at least 1 CPG has been identified each year, peaking in 1996 when 10 
CPGs were identified. 
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these are the SWI-SNF chromatin-remodelling pathway, which has been 
linked to rhabdoid tumours and meningiomas; the succinate dehydro- 
genase enzyme complex, which is associated with phaeochromocytoma 
and paragangliomas; and the PI(3)K-mTOR signalling pathway, which 
has links with several CPGs, including TSC1, TSC2, PTEN, STK11, FLCN, 
HRAS and TMEM127, and hence is associated with diverse cancers”. 


Cancer phenotypes 

Itis currently estimated that around 3% of cancers are due to CPG muta- 
tions, which is equivalent to more than 300,000 cancers per year world- 
wide. This is an underestimate because the contribution of known genes 
has been poorly characterized and not all genes have been identified. 
The contribution to individual cancers is highly variable. The highest 
attribution is to childhood embryonal tumours such as retinoblastoma 
and pleuropulmonary blastoma which are often due to germline muta- 
tions in RBI and DICER1, respectively”. This simplicity is not applicable 
to all childhood cancers; the embryonal kidney cancer Wilms tumour 
is associated with several CPGs and other predisposition mechanisms, 
which together account for less than 5% of cases*””. At the other end of 
the spectrum, known CPGs make a very small contribution to some adult 
cancers, such as prostate and lung cancer. However, germline CPG muta- 
tions in multiple genes predispose carriers to other adult cancers such as 
breast, colorectal, melanoma and ovarian cancer. For some, the overall 
contribution of CPGs is sizeable, with around 15% of ovarian cancers, 
about 20% of medullary thyroid cancers and more than 30% of phaeo- 
chromocytomas due to CPG mutations. 

Some CPGs preferentially predispose carriers to specific histological 
subtypes of a cancer. For example, BRCA1 is particularly associated with 
triple-negative breast cancer and serous ovarian cancer, whereas CDH1 
is particularly associated with lobular breast cancer and diffuse gastric 
cancers”, The genomic profiles of cancers arising in individuals with 
germline CPG mutations can also be distinctive; chromothripsis, which 
describes localized chromosomal shattering, is striking in medulloblasto- 
mas that occur in TP53 germline mutation carriers, and cancers in BRCA1 
or BRCA2 mutation carriers have a characteristic mutational signature 
that includes substantial numbers of deletions with overlapping micro- 


homology at the breakpoint junctions™”. 


Non-cancer phenotypes 

Clinical phenotypes in addition to cancer often occur in individuals with 
CPG mutations, with 87 CPGs being associated with non-cancer clinical 
features. These are often more discriminating and more common than 
cancer, and can be crucial to clinical diagnosis of the underlying cancer 
syndrome. The spectrum of additional clinical features is very broad. Skin 
manifestations are the most frequent and can be specific to the relevant 
CPG. They include hypopigmented and/or hyperpigmented areas, freck- 
ling, rashes, blistering, hypertrophy, skin tags, and nodules and/or lumps. 
Neurological, dysmorphic and skeletal manifestations also occur, but are 
usually nonspecific features such as microcephaly, macrocephaly, short 
stature and/or developmental delay. The proportion of CPGs associated 
with non-cancer clinical phenotypes is probably an overestimation of the 
true proportion, as identification of genes that result in a readily clinically 
recognizable phenotype is inevitably more tractable. 


Genotype-phenotype associations 

One of the illuminating outcomes of CPG research has been increased 
knowledge about the diversity of mutational mechanisms and their rela- 
tionship with phenotype. Even superficially straightforward associations 
can mask profound complexity. For example, gain-of-function germline 
HRAS missense mutations cause enhanced MAPKK and PI(3)K signal- 
ling similar to somatic mutations”. However, the spectrum of germline 
and somatic mutations differs, and germline HRAS mutations not only 
predispose to cancer but also cause a multisystem disorder called Costello 
syndrome”. This condition includes distinctive facial dysmorphism anda 
wide range of cardiac, dermatological, musculoskeletal and developmen- 
tal abnormalities. The role of HRAS in these processes, and why HRAS 
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mutations lead to such a complex phenotype, is unknown. 

The genotype-phenotype relationships of some CPGs are extraordi- 
narily intricate and hint at deep uncharted complexities in gene function. 
Most WT1 mutations predispose carriers to Wilms tumour, genitourinary 
abnormalities and renal dysfunction, the severity of which is influenced 
by the mutational type. However, intronic mutations that alter the relative 
abundance of WT1 isoforms cause a distinct condition, called Frasier 
syndrome, which includes gonadoblastoma rather than Wilms tumour, 
a focal-segmental nephropathy and severe gonadal dysgenesis, which can 
manifest as complete sex reversal”. 

TERT, which encodes telomerase, is another example. Recently, acti- 
vating promoter mutations that result in increased telomerase expres- 
sion were shown to predispose carriers to melanoma”. Monoallelic and 
biallelic, primarily missense TERT mutations cause dyskeratosis congen- 
ita, which is characterized by various physical abnormalities, pulmonary 
fibrosis, bone-marrow failure and increased incidence of acute myelog- 
enous leukaemia and squamous carcinomas of the head and neck and 
anogenital region®’. Furthermore, various common SNPs in the vicinity 
of TERT confer small risks of several cancers, including breast, colorectal, 
testicular and prostate”. The mechanisms underlying the diversity of 
TERT phenotypic associations are unknown. 

TGFBR1, which encodes a transmembrane serine/threonine kinase 
receptor, is one of the most extreme examples of genotype-phenotype 
diversity. Missense mutations in the kinase domain cause marfanoid 
vasculopathies with no increased risk of cancer. Truncating mutations 
in the same kinase domain, or missense mutations in the extracellular 
ligand-binding domain, cause a highly unusual condition called multi- 
ple self-healing squamous epithelioma. Individuals with the condition 
develop squamous-carcinoma-like locally invasive skin tumours that 
grow rapidly for a few weeks then spontaneously regress and scar”’. 


CPG cancer risks 

There is a deep and widely underappreciated complexity in the risks 
of cancer conferred by CPG mutations. A specific CPG mutation can 
confer different risks of developing different cancers. Different CPG 
mutations can confer different risks of developing a particular cancer. 
A specific CPG mutation can even confer different risks of develop- 
ing a particular cancer in different contexts. BRCA2 illustrates all of 
these scenarios. Loss-of-function BRCA2 mutations confer substan- 
tial increased lifetime risks of breast and ovarian cancer but only small 
increased lifetime risks of prostate and pancreatic cancer”. However, 
not all mutations confer the same risk, despite most being protein- 
truncating mutations predicted to result in nonsense-mediated RNA 
decay and thus to be functionally equivalent. Loss-of-function BRCA2 
mutations in the central part of the gene confer significantly higher rela- 
tive risks of ovarian cancer compared with breast cancer than mutations 
at either end”. The mechanistic basis for this highly unusual pattern 
is unknown. The degree of family history also impacts on the risk of 
cancer of BRCA2 mutations. The lifetime breast cancer risk of female 
BRCA2 mutation carriers with a strong family history is around 80%, 
but the risk is only about 45% for relatives of breast cancer cases unse- 
lected for family history’*”’. This reflects, at least in part, additional 
modifying factors within familial clusters that increase cancer risk. 
Some genetic and non-genetic modifying factors of BRCA2 and BRCA1 
cancer risks have already been identified, although it is likely there is 
still much to be discovered’”**". 

TP53 is another gene that has been known for more than 20 years to 
predispose carriers to cancer, but our knowledge of the associated cancer 
risks is still lamentably incomplete. All germline TP53 mutations are 
typically assumed to be highly penetrant, but the widely quoted cancer 
lifetime risks are derived from small series of highly selected cases®. In 
fact, there is strong evidence of high variability in the types and risks of 
cancer associated with different [P53 mutations. This is exemplified by 
TP53 R337H, which confers a modest 10% risk of adrenocortical cancer 
and is not associated with increased risks of other classic Li-Fraumeni 
syndrome cancers such as breast cancer or sarcoma™. 
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Figure 3 | Overlap between somatically mutated cancer genes and cancer 
predisposition genes (CPGs). 468 genes with somatic driver mutations in 
cancers are recorded in the COSMIC database of which 49 are also included 
within the 114 CPGs. 


Clinical utility of CPGs 

The identification of CPGs has had substantial clinical impact. Cancer is 
one of the foremost diseases for which such discoveries have transformed 
medical care in multiple areas, including cancer prevention. 


Diagnosis and patient management 

The benefits of determining whether a cancer is due to a germline CPG 
mutation are incontrovertible. As such, CPG testing has become stand- 
ard for many genes, albeit typically only in highly selected cases. From a 
patient perspective, simply having a better understanding of why their 
cancer occurred is usually highly valued. It also provides important infor- 
mation that can aid diagnosis and management, for instance whether 
surgery should be conservative or radical. Radiotherapy and chemo- 
therapy may also be altered. For example, platinum-based therapies are 
not standard treatment for breast cancer but can have utility in BRCA 
carriers****. Conversely, temozolomide is unlikely to be of benefit and 
may actually promote neoplastic progression in MSH6 mutation carri- 
ers*°*”, Identifying an underlying CPG mutation also provides impor- 
tant prognostic information; survival is significantly better for BRCA2 
mutation-positive patients with ovarian cancer but significantly worse 
for BRCA2 mutation-positive patients with prostate cancer**”. The likeli- 
hood of recurrence, a new primary and/or a second malignancy can all 
be increased in CPG mutation carriers who require ongoing review and 
consideration of tailored surveillance and/or risk-reducing interventions. 
Management of non-cancer-associated problems can also be important, 
for example certain WT1 mutations result in insidious renal dysfunction, 
which requires monitoring and early intervention. 


Targeted therapies 

There is intense activity in the development of tailored therapies for can- 
cer and strategies targeting CPGs and their constituent pathways have 
been among the most innovative and fruitful. The rationale is to harness 
knowledge of the underlying cause of cancer to identify tumour-specific 
vulnerabilities that can be therapeutically exploited. The simplest model 
is in cancers caused by gain-of-function mutations, which can be directly 
downregulated by inhibitors such as imatinib (KIT and PDGFRA), 
vandetanib (RET) and foretinib (MET) ”. Trying to switch on genes 
that have been mutationally inactivated is more challenging. A direct 
approach of using compounds that ‘read-through’ stop codons is showing 
some promise, as is gene therapy’. Inhibiting a pathway member that 
is upregulated as a result of the CPG mutation has also had success. For 
example, everolimus, an mTOR inhibitor, is now approved for treatment 
of astrocytomas in tuberous sclerosis, and vismodegib, which inhibits 
the hedgehog pathway, has been found to be beneficial for basal-cell 
nevus syndrome patients with PTCH1 mutations””*. Perhaps, the most 
innovative approach has been through inducing synthetic lethality. PARP 
inhibitors, which cause lethality in BRCA deficient tumour cells but not 
normal cells with monoallelic mutations exemplify this approach, which 
is now being pursued for other CPGs” *”. Currently, these therapies are 
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still largely being evaluated in research studies and clinical trials, but 
there is optimism that identification of CPG mutations will increasingly 
lead to personalized management for patients with cancer. 


Screening and prevention 

An important benefit of CPG testing is being able to provide information 
about cancer risks to relatives. One of the unusual characteristics of CPGs 
is their capacity to serve as a biomarker of future disease. Identifying a 
CPG mutation can provide a window of opportunity to implement sur- 
veillance and/or risk-reducing measures that mitigate or prevent cancer. 
Naturally, the type of screening is determined by the type of cancer but 
most often involves imaging to detect a lesion before it presents clini- 
cally. Sometimes, a biochemical marker of risk can be measured, such as 
catecholamines or calcitonin in individuals who are at risk of phaeochro- 
mocytoma or thyroid cancer, respectively’. The presumption is that ifa 
cancer is detected early, treatment and survival will be improved, although 
this has rarely been proven and for some cancers the available evidence 
does not suggesta benefit”. Prevention usually involves surgical removal 
of the at-risk tissue and is necessarily reserved for non-essential organs 
in individuals at very high risk, such as the stomach in CDH1 mutation 
carriers, the thyroid in RET mutation carriers and the colon in APC muta- 
tion carriers"’"”'**, Chemoprevention is an attractive strategy, but to date 
there have been few applications. A notable exception is mismatch repair 
gene mutation carriers, in whom the risk of colorectal cancer is signifi- 
cantly reduced by daily aspirin'™. Of equal value, although commanding 
much less fanfare, is the use of CPG mutation testing in identifying people 
who do not have a familial CPG mutation. Such individuals do not have 
the high cancer risk of their relatives, reducing anxiety for themselves 
and their offspring. They also typically do not require costly, intensive 
screening or preventative interventions. 


Pitfalls in CPG research and clinical practice 

The study of CPGs has led to tremendous scientific and medical advances 
of broad and lasting impact. However, the field has been hampered by 
incorrect interpretations of genetic data, which can have substantial nega- 
tive consequences. 

The first major problem is the incorrect classification of a gene as a 
CPG. There are, unfortunately, dozens of genes in widely-used databases 
such as OMIM (Online Mendelian Inheritance in Man) and HGMD (The 
Human Gene Mutation Database) that are designated CPGs but for which 
the evidence is at best uncertain. For most, interpretation of the available 
data strongly suggests the gene does not confer high or moderate risks 
of cancer. There are various reasons for these misclassifications. Until 
recently, the extent of human coding variation was poorly appreciated, 
leading to overestimation of the likely causal link between the presence 
of a gene variant and cancer in an individual. This problem is actually 
increasing with exome-sequencing studies, with many researchers failing 
to appreciate that rare coding variations, including putative deleterious 
mutations, are collectively common’. The most widespread miscon- 
ception is that the absence from the control group ofa specific rare muta- 
tion that was identified in cancer cases provides evidence of causality, 
whereas usually it merely provides additional evidence that it is rare. 

Over-extrapolation of concepts and data is a pervasive problem in CPG 
research and manifests in many ways. First, it is often presumed that ifa 
gene mutation causes one cancer that any other cancer that occurs ina 
mutation carrier is also probably attributable to that gene, whereas fre- 
quently it will be coincidental because cancer is very common. Second, 
it is frequently incorrectly assumed that because one mutation class (for 
example, truncating mutations) predisposes carriers to cancer that vari- 
ants in other classes (for example, missense mutations) are also causa- 
tive, but many will be rare, innocuous variants. Third, it is commonly 
assumed that if some genes in a pathway are CPGs then variants in other 
gene members of that pathway are de facto likely to predispose carriers to 
cancer. Fourth, it is widely thought that cancer risks of CPG mutations are 
constant and can be extrapolated from one context to another, whereas 
many factors can influence the clinical expression of a CPG mutation. 
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Finally, it is often incorrectly assumed that if a variant is shown to have 
some kind of functional impact, this proves it is pathogenic. CPGs have 
multiple complex functions and the relationship between functional aber- 
rations and clinical phenotype is typically unclear or unknown. There are 
very few CPG functional assays that have been validated as robust tests of 
clinical pathogenicity. Thus, although functional data can provide sup- 
portive evidence for pathogenicity it can very rarely serve as a substitute 
for robust genetic evidence. 

The extent to which these presumptions lead to incorrect scientific 
inferences and inappropriate clinical management depends on the spe- 
cific CPG and scenario. However, significant and unacceptable negative 
impacts can result, including unwarranted surgery in healthy individuals. 


Future opportunities and challenges 

The future for CPG research is very bright, both in terms of scientific 
discovery and clinical translation. Strong evidence from multiple sources 
indicates that more CPGs remain to be discovered. Exome and genome 
sequencing are ideally suited to their identification, although standards to 
ensure consistent, robust designation as a CPG are required. For familial 
and syndromic cancer conditions, exome sequencing methods developed 
for Mendelian disorders will probably be successful, and are already yield- 
ing new genes’”"®, As with other common, complex conditions, identifica- 
tion of non-syndromic genes will remain challenging, at least until it is 
possible to sequence, analyse and interpret data from many thousands of 
individuals. Innovative sample and analytical prioritization strategies, in 
the spirit of those used so successfully in the past will thus probably have 
high utility over the next few years’. 

It is also important to recognize that in this Review I have focused on 
germline gene mutations with high or moderate risks of cancer. Other 
components of the genetic architecture of cancer predisposition, such as 
common variants with small effects are also important and the interplay 
of different genes and variants is a topical field that will probably reveal 
new and clinically relevant insights”**'""””. It is also increasingly apparent 
that many other mechanisms are likely to play a part. One emerging area 
is the role of mosaic mutations, particularly in individuals with multiple 
cancers. Genetic and epigenetic cancer-predisposing post-zygotic events 
have been identified, for example mosaic HIF2A mutations in individuals 
with paraganglioma and H19 hypermethylation in children with bilateral 
Wilms tumour’. More recently, mosaic PPM1D mutations associ- 
ated with increased risk of ovarian and breast cancer have been reported, 
although the mechanism of cancer association is currently unclear’. 
Even though a considerable proportion of genetic predisposition to cancer 
probably resides in CPGs, the genetic architecture of cancer predisposi- 
tion includes other components, many of which may be undiscovered. 

Opportunities to use CPGs to improve management of patients should 
be vigorously pursued. This will lead to optimized, personalized care 
for mutation carriers and will probably provide insights of broader rel- 
evance to cancer, as exemplified by countless CPG-based discoveries of 
the past. The rarity of CPG mutations impedes research, and improved 
networks and registries of mutation carriers would greatly enhance 
the field. Routine integration of germline CPG testing into clinical tri- 
als will be invaluable, as will better collaborative links between somatic 
and germline cancer research. Probably the most important goal, which 
would facilitate all of the points mentioned, is to increase availability 
of CPG testing to patients with cancer. Next-generation sequencing 
makes large-scale, high-throughput CPG testing possible and afford- 
able, but the clinical infrastructure needed to appropriately deliver such 
testing requires development. Various initiatives are seeking to achieve 
this, such as the UK Mainstreaming Cancer Genetics programme 
(http://www.mcgprogramme.com). 

It is imperative that comprehensive evaluation of known CPGs in large 
patient and population series is performed so their cancer risks, clinical 
phenotypes, genotype-phenotype associations, genetic and non-genetic 
modifying factors, and contribution to cancer can be clarified. Large- 
scale, international, integrated molecular and clinical databases and analy- 
ses will greatly facilitate these endeavours. Enthusiasm for feedback of 
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incidental findings is best tempered until these data are available. Recently, 
the American College of Medical Genetics issued a policy statement rec- 
ommending that incidental findings in 24 CPGs should be returned to 
the patient, irrespective of age or specific consent"”’. This has stimulated 
intense debate about possible ethical and legal ramifications. However, 
scant attention has been given to the arguably more pressing concern of 
our insufficient knowledge about the clinical consequences of mutations 
identified opportunistically. As already discussed, there is evidence to 
suggest that the impact of incidental mutations may differ substantially 
from that of mutations detected in individuals with a clinical phenotype 
and thus more curation and more caution are required. 

That being said, the impact of CPG mutations in the general popula- 
tion is of high interest and has the potential to provide health benefits and 
opportunities for cancer prevention. It is often assumed cancer surveil- 
lance is of intrinsic value and should automatically be instituted in CPG 
mutation carriers. However, for most surveillance programmes there is 
little or no actual evidence of an improvement in outcome. The lack of 
proven efficacy and the potential risks of screening, such as overdiagnosis, 
misdiagnosis, false positives and false negatives, are rarely discussed. The 
low frequency of CPG conditions makes randomized clinical surveillance 
trials challenging. It also leads to the misguided impression that ad hoc 
screening in individual CPG families is a trivial burden. In fact, instituting 
decades of surveillance to relatives in a single family can bea very consid- 
erable financial outlay. To implement this at the population level, which 
may insidiously occur as genome sequencing becomes routine, could 
spiral into sizeable strains on the capacity and purse of health services. 
This is particularly likely if individuals with rare variants of unproven 
pathogenicity are (inappropriately) included in enhanced surveillance 
programmes, as is currently often the case. To ensure consistent, appro- 
priate and affordable management of at-risk individuals, there needs to 
be a grass-roots move away from reflex interventions and to application 
and adherence to the accepted criteria of effective screening tests’””. In 
parallel, we need to invest energy in developing carefully considered and 
evaluated strategies that maximize the benefits of identifying people who 
are at increased risk of cancer. m 
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Circuit dynamics of adaptive 
and maladaptive behaviour 


Karl Deisseroth!*" 


The recent development of technologies for investigating specific components of intact biological systems has allowed 
elucidation of the neural circuitry underlying adaptive and maladaptive behaviours. Investigators are now able to observe 
and control, with high spatio-temporal resolution, structurally defined intact pathways along which electrical activ - 
ity flows during and after the performance of complex behaviours. These investigations have revealed that control of 
projection-specific dynamics is well suited to modulating behavioural patterns that are relevant to a broad range of psy- 
chiatric diseases. Structural dynamics principles have emerged to provide diverse, unexpected and causal insights into 
the operation of intact and diseased nervous systems, linking form and function in the brain. 


long-sought goal in psychiatry is to understand the concrete 

mechanistic distinctions between adaptive, physiological behav- 

ioural states and psychiatric-disease-related behavioural states, 
which are symptomatically complex in a way that can seem inacces- 
sible to physical interpretation or precise intervention. This situation 
stands in contrast to, for example, our mechanistic understanding of 
the equally diverse symptoms of heart failure (ranging from shortness 
of breath to swelling of extremities), which can be concretely explained 
and understood using a single physical formulation with robust causal 
explanatory power: reduced pumping action of the heart muscle. Unfor- 
tunately, such causal tissue-level understanding does not yet exist in 
psychiatry’, despite pioneering genetic and pharmacological studies 
that have provided causal insight at the molecular level, and brain-wide 
observational techniques such as BOLD-fMRI (blood-oxygen-level- 
dependent functional-magnetic resonance imaging) and electroen- 
cephalography that have provided circuit-level correlates (but without 
causal information)’””. 

Although neural connections are presumed to transmit information- 
rich signalling streams, long-range projections might be relevant to neu- 
ropsychiatric symptoms through neural dynamics (how brain activity 
is managed and coordinated over time and space), as significantly as 
through circuit coding (how detailed neural information is actually 
represented). Recent advances in neuroscience have revealed that sur- 
prisingly potent and specific control of complex behaviours arises from 
modulating circuit projection dynamics, even though we have very little 
causal knowledge of information coding or representation. This Review 
will consider the technological advances that allow this insight into the 
causal underpinnings of maladaptive behaviour, in the context of the 
structural dynamics findings that have been made accessible by the 
application of new technologies. 


Clinical context 

Deep-brain stimulation (DBS) with microelectrodes has set the stage 
for real-time causal intervention in neuropsychiatric conditions. 
Microelectrodes can be placed in anatomically well-defined locations 
for region-specific neuromodulation’, and have delivered potent and 
specific elicitation or resolution of subsets of symptoms in disease states 
ranging from parkinsonism to depression® '. However, DBS mecha- 
nisms and effects have been mysterious, partly because DBS electrode 


stimulation directly causes mixed patterns of excitation and inhibition 
in diverse local cells and in axons from distant sources that are passing 
through and may be unrelated to the function of the implanted region’. 
An interesting pattern is emerging from our clinical experience of psy- 
chiatric DBS; many of the most promising targets involve electrode 
contact placement in white matter (the long-range axonal connections 
that wire brain regions together) rather than in grey matter or cell-body- 
dense regions®*. DBS targets that fit this pattern include subcallosal 
cingulate white matter for depression, the medial forebrain bundle for 
depression and anhedonia (the lack of enjoyment of normally reward- 
ing experiences), the anterior commissure for depression, zona incerta 
(ZI) or subthalamic nucleus (STN)-proximal white matter tracts for 
depression and obsessive-compulsive disorder (OCD), ventral capsule 
white matter in OCD and depression, inferior thalamic peduncle white 
matter in OCD and depression, and even the vagus nerve axon bundle 
(including afferent fibres to the solitary tract nucleus) in depression’. 
However, the mechanistic implications of this pattern are unclear. The 
brief high-frequency pulses that are typically delivered through DBS 
electrodes may simply be better suited to stimulating axons than cell 
bodies*”’, and as point sources, DBS electrodes will more efficiently 
control a tract of axons in a relatively small volume than directly control 
a larger region of cell bodies in grey matter”. 

Apart from these important technical considerations, long-range 
projection properties (whether adaptive or maladaptive) may repre- 
sent suitable final common pathways for governing psychiatry-related 
behavioural states. Although structurally defined, these features of 
brain anatomy can place natural bounds on tissue dynamics, and can 
do so independently of neural coding per se. The excitability, myeli- 
nation and conduction properties, valence (excitation, inhibition or 
modulation) and net synaptic strength of axonal connections that 
constitute a particular long-range projection in the brain (all physical 
quantities that can be set by genetics, development and plasticity) are 
suited to govern circuit-level dynamical properties that have previ- 
ously been linked to psychiatric disease, such as excitation/inhibi- 
tion (E/I) balance’*”, synchronization of activity across and within 
brain regions'*”, and extent of activity propagation through brain 
regions*””**. Until recently, testing the causal impact of specific long- 
range projections on behaviour was not possible — achievable only 
with a new experimental methodology. 


‘Department of Bioengineering, Stanford University, Stanford, California 94305, USA. Department of Psychiatry, Stanford University, Stanford, California 94305, USA. “Howard Hughes Medical 


Institute, Stanford University, Stanford, California 94305, USA. 


16 JANUARY 2014 | VOL 505 | NATURE | 309 


© 2014 Macmillan Publishers Limited. All rights reserved 


REVIEW 


Technology 

Stimulated, in part, by this clinical context, technologies that work 
with specificity at the level of circuit wiring in animal models have 
become the focus of much recent research interest in neuroscience. 
New approaches that allow interrogation of specific nervous system 
connections include projection-defined activity control, as well as pro- 
jection activity mapping and structural mapping. The outcome of this 
technology development has been the ability to control and observe 
specific connections within the intact nervous system through precise 
circuit-level measurements and interventions. 

Achieving control over cells defined by specific connectivity is 
difficult, but essential to progress along the path towards the causal 
understanding of function and dysfunction of neural circuitry. Already, 
research in some invertebrate nervous systems (within which defined 
cells can be selectively controlled by microelectrodes) has demonstrated 
the importance of knowing the wiring patterns of the cells targeted for 
control, to achieve predictable influence over the complex and sensitive 
dynamical properties of even small networks of neurons”. However, in 
mammalian systems, until very recently it was not possible to control 
the high-speed dynamics of cells defined by wiring, because even when 
directed to fibre bundles or white matter, electrodes cannot select as the 
initial direct target a particular projection defined by cell-body origin 
and trajectory. Optogenetics*”*’ (Box 1), which allows researchers to 
have direct control over cells defined by projection pattern?” ™, was 
initially applied to brain disease in a study probing DBS mechanisms”. 


It was found that amelioration of parkinsonian symptoms could be most 
robustly achieved when the light-sensitive elements were chiefly afferent 
axons rather than local cell bodies’***. Shortly after, optogenetic control 
of specific trajectory-defined projections in behaviour was achieved”, 
unexpectedly revealing endogenous inhibition of anxiety-related states 
in the amygdala, in a study using gain- and loss-of-function interven- 
tions to targeted amygdala projections. Optogenetic approaches have 
since allowed projection-defined activity control in behaviours related 
to reward, motivation, depression, social interaction, compulsions, cog- 
nition and other domains of normal and maladaptive brain function 
(discussed later). Of note, projection targeting is only one application 
of optogenetics that is distinct from other approaches such as control- 
ling distinct cell types within a region to assess the impact on physiol- 
ogy or behaviour, developing prosthetic or repair strategies, mapping 
detailed wiring patterns, and perturbing dynamics precisely during 
high-dimensional observation (recording or imaging) for population 
and state space analysis. 

Controlling activity in cells defined by projection pattern would ideally 
be complemented by the observation of native activity and structure in 
these cells (Box 1). Indicators of neural activity could in principle be used 
to selectively record the activity of a projection in vivo, for example with 
genetically encoded calcium indicators (GECIs)”, which can be intro- 
duced through focal injection of increasingly sophisticated targeting viral 
vectors””**!, Cells could be connectivity-defined for activity imaging if 
this transduction process were to occur in a wiring-dependent manner, 


BOX1 


Opsin genes carried by viral vectors are focally injected into the 
brain. Top left, a projecting population has been defined by injection 
of a retrograde viral tool to deliver an opsin or recombinase**!* into 
the upper left cell population; to provide projection specificity of the 
control, light is delivered by a fibre-optic not to the same location but 
to a distinct focal region of upstream cells (typically about 1 mm* 

in size). Top middle, in a technically different approach*?? 34125, 
microbial opsins — ideally enhanced with membrane-trafficking 
motifs from mammalian channels'**!”6 — are transported down 
axons, which become the photosensitive elements in downstream 
fibre-optic-targeted structures. In this case, a well-tolerated (chiefly 
non-retrograde) adeno-associated virus or lentiviral vector may be 
used, but weeks are needed for effective opsin transport. Inhibition 
of the projection can be achieved with an inhibitory opsin gene**”?, 
encoding a chloride or proton pump, whereas stimulation of 

cells defined by the projection can be achieved with an excitatory 


Typical projection-targeting experimental configurations 


opsin gene**!*°, encoding a cation channel (in the case of the 


latter, by using channelrhodopsins, antidromic action potential 
backpropagation from the site of light delivery may occur, which 
can confer the feature of fully recruiting cells defined as possessing 
the specified projection among other possible projections). Top 
right, recording with various modalities (activity photometry”? is 
shown here with an additional readout optical fibre, or electrical 
recording as in bottom row) at any point of interest in the brain 
allows assessment of the circuit dynamical underpinnings of the 
behavioural changes observed or photo-tagging of cells defined by 
projection. Bottom left, high-speed multi-unit electrical recording; 
spikes corresponding to cells defined by genetically determined 
opsin expression are identified through sufficiently low-latency 
spikes after a light pulse. Bottom right, same as at left except cells 
are defined for photo-tagging by projection illumination rather than 
projection transduction. Adapted with permission from ref. 127. 
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and certain rabies- or herpes-virus-based tools (although these are limited 
in utility by toxicity to target cells) will transduce axon terminals at the 
projection location”, in principle allowing the observation of activity 
in projection-defined cells. Although these upstream cells may be distrib- 
uted over large distances, activity imaging is carried out focally, thereby 
spatially defining cells that give rise to a specific projection (Box 1). 

Activity recording of projection-defined cells (although not of the 
projection itself) when animals are exhibiting a certain behaviour has 
been achieved using optogenetics itself as an identification tool to dis- 
criminate recorded cells** * based on wiring phenotype. Through this 
approach, an excitatory opsin is conditionally expressed based on pro- 
jection pattern by the introduction of the opsin-carrying axon-trans- 
ducing virus (for example, rabies or herpes) into a terminal field of the 
projection. Then pulses of light delivered to the cell-body location may 
elicit the lowest-latency (directly excited) electrical spikes from cells 
bearing the projection of interest (Box 1); the characteristic waveforms 
of these fast-responding cells can be quantitatively defined so that sub- 
sequent multielectrode recording during behaviour will allow the rec- 
ognition of spikes from these projection-defined cells**’. Although the 
specificity of this ‘tagging’ strategy is not absolute unless spike latencies 
in the low-ms range can be achieved or (more likely) details of the local 
circuitry preclude fast indirect synaptic activation”, this combination 
of optogenetics and multielectrode recording has allowed, in some 
settings, the real-time activity recording of cells defined by the projec- 
tion target (which need not be identical to the distant activity of the 
projection termination itself) during mammalian behaviour. 

Finally, to complement projection-based activity control and pro- 
jection-based activity measurement, detailed observation of the brain- 
wide physical form of the projections in question will be important. 
Visualization of neural projections linked to causality information and 
molecular descriptors will deepen our understanding of the neural 
structural dynamics that underlie behaviour, although so far pioneer- 
ing sectioning or ablative electron microscopy and array tomography 
methods**? (in some cases with correlational information on activ- 
ity patterns) are not readily linkable with causal information on the 
behavioural significance of the connections. Clearing chemicals such as 
Scale™, BABB’”**, SeeDB” and ClearT™ allow degrees of brain transpar- 
ency (although without molecular phenotyping because the resultant 
brain tissue remains largely impermeable to macromolecular labels such 
as antibodies). High-throughput projection-mapping””’ tools involve 
thin-sectioning and reconstruction by alignment, and, in an approach 
without sectioning, the electrochemical technology CLARITY allows 
the construction of crosslinked hydrogels from within tissue and sub- 
sequent electrophoretic removal of membrane lipids, thereby allow- 
ing the penetration of photons and macromolecular labels throughout 
the intact mammalian brain®™*' (Fig. 1). All of these approaches can be 
linked to causal or activity information on the projections observed 
if registration of data sets is conducted; projection activity could be 
accessible not only with acute slice approaches™ but also with in vivo 
behavioural methods because opsin or GECI probes can be integrated 
with, or include, cell-filling fluorescent labels that allow projection map- 
ping by light microscopy after the completion of behavioural testing or 
activity imaging. If needed, light-microscopy-based projection maps 
can also be linked to ultrastructural information to describe the syn- 
aptic properties and targets of the projection. 


Circuit -level findings 

Together, the technologies delineated have opened the door to deter- 
mining and understanding the causal significance of defined projec- 
tions in the control of behaviour. Here, I discuss behavioural findings 
that have emerged about the interrelationships between adaptive and 
maladaptive behaviour through experiments probing causal structural 
dynamics at cellular resolution and within the intact brain. For the pur- 
poses of emphasis on behavioural and clinical insight, I highlight the 
major psychiatric symptom domains relating to anxiety and depression. 
A complete summary is provided in Table 1. 
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Figure 1 | Visualizing projections in the intact mammalian brain. The 
entire mouse brain has been processed with CLARITY (with Thy-1 

type projection neurons labelled in green); such brain wide structural 

data sets may be integrated with other data streams from the same animal, 
including activity records during behaviour (as obtained with GECIs), stable 
activity markers such as immediate-early genes, whole-mount molecular 
phenotyping, behavioural scores in the presence or absence of optogenetic 
or disease-model interventions, and opsin—fluorophore expression patterns. 
In general, whole-brain analysis is useful for interpretation; for example, use 
with multiple retrograde labels may help to determine whether individual cells 
include collaterals projecting to multiple sites, an important consideration in 
projection targeting. Adapted with permission from ref. 61. 


Causal projection dynamics linked to anxiety 

Anxiety disorders, which include generalized anxiety disorder, panic 
disorder, post-traumatic stress disorder and OCD, represent the most 
prevalent class of psychiatric condition. Building on earlier work that 
implicates structures such as the extended amygdala in fear and anxi- 
ety, and the striatum in OCD, recent studies have made headway in our 
understanding of the causal projection dynamics of anxiety-related 
symptoms. In an emerging theme, fundamentally different results were 
often obtained when projection-defined circuit elements were recruited, 
compared with the recruitment of region-defined cell-body populations. 

In initial anxiety-related investigations”, freely moving mice were 
assessed using the elevated-plus maze and the open-field test, which 
allow inference of apprehension in the absence of immediate threat. 
When the cell bodies of excitatory (CaMKIla-expressing) neurons were 
optogenetically stimulated in the basolateral amygdala (BLA) (without 
regard to projection target), an immediate (and immediately revers- 
ible) anxiogenic effect was observed in which subjects began to avoid 
exposed areas of test arenas, without other changes in locomotor perfor- 
mance. Conversely, when these neurons were optogenetically inhibited, 
an anxiolytic effect was seen”; together, these results were consistent 
with classical anxiogenic models of the BLA. 

However, more refined projection targeting revealed unanticipated 
anxiolytic circuitry embedded within this otherwise anxiogenic BLA 
circuit environment. When only a subpopulation of the CaMKIIa- 
expressing elements — namely lateral projections from the BLA towards 
the central nucleus of the amygdala (CeA) — were stimulated instead, 
a prominent anxiolytic effect was observed”*. Conversely, inhibition of 
this projection elicited anxiogenesis”. This surprising arrangement (that 
required optogenetic projection targeting to resolve) is well-suited to pro- 
viding diverse access nodes for swift and versatile endogenous modula- 
tion”, and no doubt further anatomical and functional complexities will 
be elaborated. For example, studies of conditioned fear®*® have identified 
two functionally and genetically distinct types of unit in the centrolat- 
eral component of the CeA, and optogenetic stimulation of one of these 
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Table 1 | Projection-defined activity dynamics causally implicated in psychiatric-disease-related behaviour 


Projection origin Projection target Elicited activity in projection Behaviour References 
mPFC LHb Excitation Passive/immobile (swim test) 76 
DRN Excitation Active/mobile (swim test) 76 
nhibition Passive/immobile (swim test) 76 
NAc Excitation Appetitive 93 
BLA CeL/CeA Excitation Anxiolytic (risk) 62 
nhibition Anxiogenic (risk) 62 
adBNST Excitation Anxiolytic (risk, respiratory) 68 
nhibition Anxiogenic (risk, respiratory) 68 
NAc Excitation Appetitive 92,93 
nhibition Aversive 92 
vH Excitation Anxiogenic (risk) 121 
nhibition Anxiolytic (risk) 121 
adBNST LH Excitation Anxiolytic (risk) 68 
PB Excitation Anxiolytic (respiratory) 68 
VTA Excitation Appetitive 68 
vBNST VTA Excitation (glutamatergic) Aversive, anxiogenic 100 
VTA Excitation (GABAergic) Appetitive, anxiolytic 100 
LH Excitation (GABAergic) Increased food consumption 88 
LH Inhibition (GABAergic) Decreased food consumption 88 
VH NAc Excitation Appetitive 93 
LDT VTA Excitation Appetitive 97 
LHb VTA Excitation Aversive 97 
EPN LHb Excitation Aversive 99 
VTA TH NAc Excitation Appetitive 81 
mPFC Excitation Aversive 122 
LHb Excitation (GABAergic-TH) Appetitive 123 
ARC AgRP PVH Excitation Increased food consumption 87 
vmOFC VMS Excitation (chronic: probably favouring potentiation of projection) Increased grooming (wild-type mice) 70 
LOFC CMS Excitation (acute: probably favouring inhibitory cell targets) Reduced grooming (Sapap3-mutant 71 
OCD mouse model) 
adBNST, anterodorsal bed nucleus of the stria terminalis; ARC AgRP, agouti-related peptide cells in arcuate nucleus; BLA, basolateral amygdala; CeA, central amygdala; CeL, centro-lateral amygdala; 
CMS, centromedial striatum; DRN, dorsal raphe nucleus; EPN, entopeduncular nucleus; LDT, laterodorsal tegmentum; LH, lateral hypothalamus; LHb, lateral habenula; LOFC, lateral orbitofrontal cortex; 
mPFC, medial prefrontal cortex; NAc, nucleus accumbens; PB, parabrachial nucleus; PVH, paraventricular hypothalamus; TH, tyrosine hydroxylase; vBNST, ventral BNST; vH, ventral hippocampus; vmOFC, 
ventromedial orbitofrontal cortex; VMS, ventromedial striatum; VTA, ventral tegmental area. 


populations inhibited neurons projecting to periaqueductal grey”; this 
work illustrates that sufficiently precise genetic cell-type targeting (where 
feasible) can give rise to a kind of projection targeting, and together with 
earlier work has deepened our understanding of fear behaviour”. 
Although the initial anxiety study”* focused on risk-avoidance behav- 
iour (in the elevated-plus maze and the open-field test), the clinical 
anxious state is not characterized by risk-avoidance alone. Physiological 
phenomena such as respiratory-rate changes and aversive subjective 
sensations are an enormously important part of the clinical picture, 
contribute to morbidity and mortality and can be studied quantitatively 
in animal models. A subsequent paper™ extended the projection tar- 
geting concept in anxiety to study assembly of the anxious state from 
multiple separable features. This study found that a long-range projec- 
tion from the BLA to the bed nucleus of the stria terminalis (BNST) 
was unexpectedly able to favour anxiolysis, that within the BNST itself 
oppositional subnuclei favoured anxiogenesis or anxiolysis, and that 
distinct outgoing projections from the anxiolytic subnucleus (the anter- 
odorsal BNST, adBNST) each recruited distinct features of anxiolysis® 
(Fig. 2). Surprisingly, the BNST projection to the lateral hypothalamus 
was found to recruit only a behavioural risk-avoidance feature, the 
projection to the ventral tegmental area (VTA) was found to recruit 
only an appetitive (positive-valence conditioning or rewarding) feature, 
and the projection to the brainstem parabrachial nucleus was found 
to recruit only a reduced respiratory-rate feature (Fig. 2). Projection 
targeting allowed the determination of the causal nature of this feature 
separability, illustrating aspects of how complex behavioural states may 
be coordinately assembled and disassembled by projection dynamics. 
It is worth underscoring here that neural codes per se, in terms of the 
spiking representations of information, are not understood for any of 
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these internal states or behavioural outputs, but relatively complex and 
specific behaviours can still be elicited and suppressed using projection 
targeting of circuit dynamics. 

OCD highlights some of the challenges (and illustrates some of the 
solutions) for the study of disease-related symptoms in animal mod- 
els. The disorder is characterized both by symptoms that are difficult 
to assess in experimental animals (recurring and intrusive thought 
patterns) and by symptoms with clear rodent correlates (compulsive 
behaviours that are poorly suppressed even when adverse conse- 
quences result). As with many psychiatric diseases, insight into the 
overall syndrome may be obtained in animal models by focusing on 
those symptom domains that can be recapitulated experimentally. 
For example, circuits involving the orbitofrontal cortex (OFC) and 
striatum have previously been correlated with clinical OCD®, and 
two recent papers have now causally tested these circuits using pro- 
jection targeting in mouse models of compulsive behaviour. One 
study” involved driving cells projecting from the ventromedial OFC 
to the ventromedial striatum (VMS), and tracked grooming behav- 
iour (compulsive grooming bears some similarity to behaviours in the 
clinical condition). Repeated bursts of activation of this projection led 
to increased grooming over days, suggesting a mechanism by which 
pathological behavioural patterns subserved by that projection could 
become difficult to suppress. Another group began with a model of 
compulsive grooming (Sapap3 mutant mice) and targeted a projec- 
tion between the lateral OFC and the centromedial striatum (CMS)”’. 
Acutely driving this projection recruited feedforward inhibition in 
the CMS and gave rise to suppression of compulsive grooming. OFC 
is important for many behavioural and cognitive processes, includ- 
ing aspects of attention, reversal learning, risk responsiveness and 
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response inhibition”; together, these papers causally implicated 


specific projections from the OFC to the striatum in modulating 
compulsive grooming relevant to OCD. 


Causal projection dynamics of depression-related behaviour 

One of the core diagnostic criteria for major depressive disorder is 
the clinically defined symptom of hopelessness, a profound negativity 
about the future that can be behaviourally manifested as a predilection 
to discount the value of choices or effort in the present. Such a behav- 
ioural pattern can be pathological and give rise to severe morbidity (and 
mortality from suicide); however, when environmental conditions are 
adverse, a ‘passive-coping’ behavioural state — in which minimal energy 
is expended — could in fact be more adaptive than an active-coping 
state*’*°, A recent report described the results of modulating dynamics 
in targeted projections arising from the prefrontal cortex, in the setting 
ofa behavioural task (the automated forced swim test, FST) designed to 
detect temporally precise transitions between active and passive coping”. 

When cells with a projection between the medial prefrontal cortex 
(mPFC, an anterior forebrain structure involved in executive function 
and planning) and the dorsal raphe nucleus (DRN, a major locus of 
serotonergic neurons) were optogenetically stimulated during the auto- 
mated FST, rats were found to shift towards an active-coping behav- 
ioural regime (swimming or climbing); the same fibre-optic-targeted 
intervention did not alter nonspecific locomotor activity assessed 
in the open field”. Surprisingly, the opposite finding was seen when 
cells defined by projections from the mPFC to the lateral habenula 
(LHb) were targeted; rats shifted toward the passive-coping (floating 
or immobile) regime, illustrating the importance in optogenetics (for 
true functional specificity) of not just cell-type targeting but also projec- 
tion targeting allowed by the fibre-optic neural interface”. And when 
cells defined by projection from the mPFC to the BLA were optogeneti- 
cally recruited, no behavioural effects in this test were seen”®. It was also 
found that targeting corresponding focal cell-body regions (instead of 
defined projections) was not well-suited to elicitation of this specific 
class of behavioural effect. First, when DRN cells were driven optoge- 
netically, but nonspecifically, increased active behaviour was seen in 
the FST but also in the open field, indicating elicitation of a less-specific 
behavioural state”. Second, when mPFC cells were generally recruited 
without specifying projection class, no transition between active and 
passive behavioural state was seen, probably partly because oppositional 
projections were recruited”. 

The converse of depression-related behavioural states (with maladap- 
tively low value assigned to actions or available choices) would be states 
in which predictions are maladaptively positive. Manic and hypomanic 
behaviour may fall into this category in the form of the core criteria of 
increased goal-directed and risk-taking behaviour (along with certain 
character traits and behavioural patterns such as pathological gam- 
bling, especially those associated with dopamine (DA)-system modu- 
lation’””* in which perception of risk”? is impaired and/or perceived 
likelihood of positive outcomes is inflated). In a transgenic rat line that 
allowed opsin expression only in tyrosine hydroxylase (TH)-expressing 
cells® in the VTA (that is, DA neurons), stimulation of these cells was 
recently found to favour the active-coping behavioural pattern in the 
automated FST without nonspecific locomotion effects in the open 
field’. Electrical recording in the anatomical target of one of these out- 
going projections (the nucleus accumbens, NAc) from multiple single 
units during the automated FST revealed that modulating DA neuron 
dynamics altered key aspects of the neural representation of action in 
the NAc”*. This particular projection (VTA-NAc) may be clinically rel- 
evant because recent work has suggested NAc-related projections are 
significant in treating patients with depression using DBS’. It will 
be interesting to investigate possible anatomical and causal linkages 
between the mPFC-DRN” and VTA-NAc”’ pathways. 

The vegetative symptoms of psychiatric disease are those linked to 
basic physiological functions, including appetite, sleep-wake balance 
and sexual behaviour. These symptoms are individually variable during 
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Figure 2 | Controlling projections in the intact mammalian brain. 
Behavioural state assembly and complexity can be assessed with projection 
targeting. Functional organization of the bed nucleus of the stria terminalis 
(BNST) circuitry is shown. The oval nucleus of the BNST (ovBNST) is 
anxiogenic, and the anterodorsal BNST (adBNST) is anxiolytic®, as is the 
projection from the BLA (red arrows); the ovBNST may act to increase anxiety 
by inhibiting the adBNST or by independently influencing downstream 
structures. The adBNST projects to the ventral tegmental area (VTA), lateral 
hypothalamus (LH) and parabrachial nucleus (PB); each of these projections 
decreases a distinct feature of anxiety expression, and coordinated recruitment 
of these subpopulations may be implemented by recurrent circuitry in 
adBNST™. Projections are assessed by introduction of opsin (needle) into the 
adBNST, and independent targeting of fibre-optic interfaces depicted for each 
candidate downstream target (blue arrows). Here (as with medial prefrontal 
cortex projections and VTA projections) little mixed-site collateralization from 
single neurons is seen®. Adapted with permission from ref. 68. 


depressive episodes, but in aggregate are common, debilitating and 
constitute core criteria for diagnosis of major depression. The initial 
optogenetic control of behaviour in mammals was in fact conducted 
on sleep-wake behaviour through the hypocretin neurons of the lateral 
hypothalamus; certain patterns of activity delivered to these cells in 
mice were found to favour the transition from sleep to wake”. It was 
later found that driving noradrenaline neurons in the locus coeruleus 
(LC) exerted a rapid and profound effect on awakening*”™, and by driv- 
ing hypocretin neurons with an excitatory opsin while simultaneously 
inhibiting LC noradrenaline neurons with an inhibitory opsin, it was 
observed that the effect of hypocretin neurons on awakening””*** could 
be accounted for by projections to the LC. Early applications of optoge- 
netics to another vegetative category, feeding behaviour, have included 
defining (and modulating) the reward value of feeding (an important 
step towards understanding how the reward value of feeding could 
become altered in the setting of psychiatric disease). Precisely tuned 
control over the value of nutrient consumption was achieved using 
optogenetic drive of DA neurons in the VTA, which shifted the value 
of one class of nutrient to be greater than the highly appetitive value of 
sucrose*’. Later papers used optogenetic projection targeting to evoke 
strikingly excessive feeding behaviour with relevance to depression, 
eating disorders and obesity by driving agouti-related peptide (AgRP) 
neurons that project from the arcuate nucleus to the paraventricular 
hypothalamus (PVH)” or by driving the GABAergic projection from 
the BNST to the lateral hypothalamus™. 
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Hedonic and anhedonic behaviours, beyond their relevance to feed- 
ing alone, are of substantial significance in psychiatric disease; anhe- 
donia is a formal diagnostic criterion for depression, distinct from 
vegetative and motivational criteria. At the other end of the spectrum, 
excessive hedonic behaviour manifests in conditions relating to sub- 
stance abuse and dependency, as well as in manic and hypomanic condi- 
tions. Dysfunction in responding to rewarding or appetitive stimuli can 
be studied in animals by determining to what extent behaviour can be 
positively conditioned by, or to what extent animals will work for, the 
stimuli in disease-like states”. Over the past five years, applications of 
optogenetic projection targeting under physiological conditions and in 
disease models have begun to illuminate the causal structural dynamics 
of hedonia and anhedonia. 

An initial step came with the optogenetic drive of VTA DA neurons, 
to causally test whether different patterns of spikes in these defined 
cells could be appetitive for mammals”. In freely moving mice, high- 
frequency ‘phasic’ bursts of spikes delivered to VTA DA neurons were 
found to potently drive place-preference conditioning, and, interest- 
ingly, seemed preferable to mice compared with the same number of 
spikes delivered at a lower ‘tonic’ frequency” (highlighting the impor- 
tance for optogenetics of eliciting precise temporal dynamics in the 
targeted cells). Although these initial experiments were carried out in 
Th-Cre transgenic mice to allow the targeting of VTA DA neurons with 
Cre-dependent opsin-expressing adeno-associated virus (AAV) vec- 
tors, in later work" rats were used as experimental subjects for robust 
assessment of the extent to which animals will work for a stimulus or 
reward. Th-Cre transgenic rats designed and created for this type of 
experiment were observed to execute many thousands of nose pokes 
per day (a test for experimental subject-initiated, motivated behaviour) 
to obtain optogenetically delivered phasic spike bursts in VTA DA neu- 
rons; furthermore, projection targeting revealed that rats will work for 
activity in VTA-DA cells that are even more precisely defined, namely 
the subset of these neurons projecting to the NAc”. It is worth noting 
that clinical DBS findings have pointed to the NAc in anhedonic symp- 
toms of major depression”. 

Optogenetic projection targeting has identified a number of additional 
afferents to the NAc that also modulate reward-related behaviour or 
appetitive conditioning, in some cases connected to psychiatric-disease- 
related symptoms. Not only will animals work to receive stimulation of 
the defined BLA projection to the NAc, but also optogenetic inhibition 
of this projection creates a depressive-like phenotype of reduced moti- 
vation to obtain a normally appetitive sucrose solution”. Subsequent 
work has implicated excitatory afferents to the NAc from the ventral 
hippocampus (vH) and mPFC in promoting appetitive conditioning 
and reward-related behaviour”; dopamine type-1 (D1) receptors in 
NAc were implicated, and the vH pathway to the NAc was found to be 
potentiated by cocaine intake”’, consistent with the earlier finding that 
D1 receptors in the NAc are important for appetitive cocaine responses™. 
Additional cocaine-related projection targeting studies have illuminated 
the relevance of PFC-NAc pathways; inhibition of the projection from 
the PFC to the NAc blocked cocaine-seeking behaviour in rats”, and 
specific inhibition of mPFC inputs to the NAc by optogenetic depotentia- 
tion elicited a reduction in cocaine-induced locomotor sensitization”. 
Together, these studies have elucidated fundamental aspects of causal 
structural dynamics that underlie abused-substance-related hedonic 
behaviours. 

Recently, much important information has also come to light con- 
cerning the causal role of the projections into the VTA itself, which 
is relevant to hedonic and anhedonic behavioural states. An opsin- 
carrying rabies virus was delivered into the VTA; projection targeting 
was then achieved by delivering light to distinct upstream cell-body 
regions”. Conditioned place preference was seen when the projection 
from the laterodorsal tegmentum (LDT) to the VTA was driven (inter- 
estingly these projections synapse preferentially onto NAc-projecting 
outgoing VTA DA neurons)”. By contrast, conditioned aversion was 
observed when the projection to the VTA was driven from the LHb” 
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— the same structure that supported a depressive-like passive-coping 
response in the FST when mPFC-to-LHb projections were driven”®. In 
acomplementary (and concordant) projection targeting set of results”, 
LHb projections to the VTA were optically driven by transducing the 
LHb with a channelrhodopsin and illuminating the VTA; this interven- 
tion was also aversive (and in fact antagonized positive reinforcement)”. 
Interestingly, the aversive LHb projections to the VTA seem to favour, 
as synaptic targets, DA neurons that project, not to the NAc, but instead 
to the mPFC”, perhaps outlining a passive—aversive loop of influence 
definable by optogenetic projection targeting, spanning the brain and 
schematized as mPFC-LHb-VTA-mPEC. There may be antagonism 
or competition for influence between appetitive and aversive projec- 
tion networks across the brain, as the aversive LHb also seems to drive 
GABAergic cells in the rostromedial tegmental nucleus (RMTg) that 
in turn inhibit putatively behaviourally activating or appetitive VTA 
neurons projecting to the NAc (lateral shell region)”. 

Other influences beyond that of mPFC converge onto the LHb; for 
example, in rats, projection targeting was used to drive the glutamatergic 
entopeduncular nucleus to LHb projection”, resulting in conditioned 
place aversion. The potential for complex regulation is substantial and, 
given the spatial overlap of the passive or aversive circuitry with behav- 
iourally activating or appetitive influences at nearly every step (nota- 
bly in the mPFC and VTA), no doubt both the anatomical precision 
of tract targeting and the type of behavioural challenge delivered will 
determine which competing pathway experimentally predominates. 
As an example of anatomical complexity, projections to the VTA can 
either promote or inhibit reward, depending on location and cell type 
of fibre origin in the anterodorsal or ventral BNST, respectively*'™”. 
Andas an example of behavioural history complexity, severe and acute 
social-defeat stressors seem to favour recruitment of an aversion-related 
DA cell population in VTA”, in contrast to chronic mild stressors that 
favour recruitment ofa behaviourally activating or appetitive DA VITA 
population”. Such complexity is expected in the mammalian brain; 
these experiments have provided only the first insights into the causal 
impact of defined spike patterns in specified cells and projections on 
mammalian hedonic behaviour. 


Outlook 

What is the significance of the observation that projection-specific 
dynamics operate naturally as control levers that are relevant to psychia- 
try? Careful interpretation is important here because a tested projection 
presumably does not subserve only the role implicated (Table 1), nor 
need all, or even most, of the fibres within the projecting tract contribute 
to the behavioural effects seen; of course, these considerations apply 
to any experimental intervention. The sign and direction of the net 
behavioural change captures only one resultant of the causal influence of 
the projection tested. Still, as already noted, in many cases a projection- 
specific dynamical modulation delivers more efficacious elicitation or 
correction of psychiatry-related adaptive or maladaptive behaviours 
compared with region-specific dynamical modulation (despite probably 
recruiting fewer targeted circuit elements). This pattern even extends 
beyond the psychiatric-disease-related domain; for example, recently, 
more precisely tuned behavioural changes were obtained through corti- 
costriatal optogenetic projection targeting compared with direct stimu- 
lation in a challenging auditory behaviour task®. 

One interpretation is that greater experimental efficacy may be 
obtained through the directional and intersectional specificity (Box 1) 
inherent to AAV opsin transduction integrated with spatially separated 
fibre-optic-based projection targeting, which minimizes the generation 
of conflicting signals (such as push-pull on the same behaviour, as was 
observed in the mPFC FST). In addition, projection targeting allows 
direct control to only be exerted over cells that have long-range projec- 
tions, without directly perturbing purely locally connected neurons that 
may be engaged in active circuit computations; although projecting cells 
can also be involved in computation and representation of information, 
these roles may be tied closely enough to the projection wiring itself that 
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delivery or inhibition of experimenter-provided spikes can be meaning- 
ful for modulating complex behaviours. But beyond such experimental 
considerations, the mammalian brain seems to be set up to allow natural 
behavioural state features to be tuned by projection dynamics (Fig. 2 
and Table 1), and many psychiatric symptoms (such as the aversive or 
negative quality of the anxious state, which is dysregulated to the point of 
morbidity in anxiety disorders) may be extreme manifestations of these 
natural features. As already described, multiple-feature natural states (for 
anxiety, these can include respiratory and behavioural changes) can be 
managed from a centralized node (for example, the BNST) that assem- 
bles these components through outgoing projections® (Fig. 2). These 
projections may set the gain for execution of each feature, so that turning 
up or down the activity of these defined projections accesses a natural 
anatomically defined circuit signal for feature (or symptom) intensity. 

This causal structural dynamics perspective contributes to our under- 
standing of natural behaviour, but may also have clinical insight implica- 
tions. Individual features of a behavioural state can be debilitating but 
not necessarily the overall state itself. For example, it may be appropriate 
to elevate both the respiratory rate and alertness to danger in a new 
environment, but only if the additional feature of subjective aversion is 
strongly recruited does a pathological state resembling clinical anxiety 
result. The clinical problem can be a feature of the state, not the state 
itself, and therefore identification of projection-specific dynamics that 
are relevant to disease has implications for our understanding of aver- 
sive, appetitive or maladaptive features that are more closely linked to 
clinically relevant morbidity than the overall behavioural state itself. 
Improved brain stimulation treatments®'” may eventually result from 
guiding focal interventions to brain locations where the identified 
symptom-related projection is most resolved from other projections; 
here integration of causal information derived from optogenetics may 
dovetail with patient-specific anatomical information from diffusion 
magnetic resonance imaging (dMRI)-based tract mapping and model- 
ling’. Investigation of the synaptic plasticity rules governing optoge- 
netically defined projections may also help us to elicit stable effects 
from brief interventions (which could be delivered through functional 
magnetic resonance imaging (fMRI)-guided trans-cranial magnetic 
stimulation (TMS)'**’®’). Molecular characterization of identified pro- 
jections with techniques” for high-content identification of expressed 
transcripts and proteins may facilitate the development of more spe- 
cific projection-informed and targeted drug therapies. And it will be 
interesting to explore whether separate efficacious white-matter DBS 
target locations for a given disorder'” actually share a common pro- 
jection component that might be identified by dMRI or CLARITY in 
human or animal settings” ". Moreover, animal-model identification 
of causally important projections may guide seed placement for clinical 
dMRI mapping, even as dMRI maps linked to clinical symptoms in turn 
feed back to inform experimental target selection in animals and guide 
dynamical modelling. 

In principle, modulation of connectivity properties can exert a highly 
sensitive and consequential influence (in certain models) over network 
dynamics and information flow”, and simulated lesioning of loca- 
tions (nodes or vertices in brain graphs) wired with different statistics 
(projection or edge properties) can give rise to different effects on network 
dynamics and connectivity'’. However, as is well-recognized by graph 
theorists in neuroscience'*”"*, it is not completely clear what biologi- 
cal scale these edges and vertices might most usefully represent, and in 
the course of generating hypotheses and valuable insights into network 
complexity and connectivity, current graph-theoretic approaches have 
considered spatial scales that range from cells to macroscopic subdivisions 
of the human brain"”®. Linking this field with causal projection-dynamics 
information arising from optogenetics may help to concretize the nature 
and scale of graph components and network measures, such as those 
describing node connectivity’ (for example betweenness centrality). The 
physical understanding provided by projection-dynamic studies could 
also help us to interpret the effects of other circuit phenomena on psychi- 
atric symptoms. For example, local E/I balance shifted by neurochemical 
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influences or medications could alter recurrent feedback within a brain 
region such as the mPFC or extended amygdala (Box 1), and, in doing 
so, determine the extent of coordinated recruitment of behavioural state 
features or symptoms governed by specific outgoing projections”. 

Although broad avenues and opportunities exist for leveraging insights 
from causal structural dynamics findings, there are also limitations. Cer- 
tainly, the challenge of projection complexity should be appreciated. Iso- 
lated tracts can control multiple behaviours, individual axons within a 
tract could differentially branch and affect multiple downstream regions 
in a symptom-relevant fashion, and individual axons could also deliver 
differentially complex and non-canonical combinations of neurotrans- 
mitters'”” that are potentially relevant to the diversity of symptoms seen 
in single discrete psychiatric diseases. From the perspective of psychiatric 
disease, it is also important to underscore that knowledge of causal pro- 
jection-dynamics phenomena will be largely limited to those accessible 
with optogenetics and therefore will tend to exclude symptom domains 
that lack widely accepted animal models”, extending to the delusions, 
hallucinations and disordered thinking of schizophrenia. These less- 
accessible symptom domains still probably involve dysfunctional pro- 
jections or wiring!” and are clinically known to cluster with other 
behavioural phenomena that are accessible in animal models (such as 
social withdrawal and stereotypies), which may therefore share related 
underlying circuit-dynamics manifestations; improved multidimensional 
and high-throughput behavioural quantification in the setting of simul- 
taneous physiological readouts will facilitate detection of such unifying 
themes. 

But even reliable and standard animal behavioural measures are sub- 
ject to controversy and differences in interpretation. One opportunity in 
causal structural dynamics research going forward may therefore be to 
improve disease-model validation and to refine standard animal behav- 
ioural measures, partly through integration of projection targeting behav- 
ioural research with clinical projection research (such as recent large-scale 
human connectome efforts using MRI, involving tractography data 
linked to symptoms). But in the end, no animal model will fully capture 
the ontogeny, pathophysiology and symptomatic complexity of human 
psychiatric disease. Therefore, it is useful to maintain attention not on 
diseases, but on restricted symptom domains. This approach is ideal for 
animal modelling, well-suited to the specificity of optogenetic projection 
targeting and well-aligned with the symptom-treatment approach of real- 
world clinical psychiatry. 

Psychiatric symptom domains, involving dynamical and distributed 
performance changes without frank regional cell loss, may in this regard 
share certain principles with developmental and evolutionary changes in 
neural circuitry. For example, over time certain environments and situ- 
ations may come to assume a different significance for an organism, and 
it will be important to adaptably recruit different features of physiology 
to create a behavioural state more optimized for this shifting significance 
— without requiring a fundamental redesign of the neural coding of 
behaviour or representation of the environment itself. This perspective 
suggests why certain classes of projections may be particularly plastic 
— or designed for variability — in a way that does not affect the neural 
code per se that corresponds to a motor plan or representation of sensory 
data, but rather the magnitude and valence of a behavioural state feature. 
This flexibility in projection functionality may in turn help to explain the 
prevalence of psychiatry-related structural dynamics in the mammalian 
population, representing hotspots for adaptive or maladaptive alteration 
in symptoms analogous to chromosomal hotspots for mutation relevant to 
cancer. Among other useful properties, projection dynamics may not only 
represent concrete physical manifestations of psychiatric symptoms, but 
may also help to contribute to modularity and flexibility of behavioural 
states during development and evolution. m= 
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Interneuron cell types 


are fit to function 


Adam Kepecs' & Gordon Fishell’ 


Understanding brain circuits begins with an appreciation of their component parts — the cells. Although GABAergic interneu- 
rons are aminority population within the brain, they are crucial for the control of inhibition. Determining the diversity of these 
interneurons has been a central goal of neurobiologists, but this amazing cell type has so far defied a generalized classification 
system. Interneuron complexity within the telencephalon could be simplified by viewing them as elaborations of amuch more 
finite group of developmentally specified cardinal classes that become further specialized as they mature. Our perspective 
emphasizes that the ultimate goal is to dispense with classification criteria and directly define interneuron types by function. 


in terms of morphology, connectivity and physiological properties’. 

Until 10 years ago, their classification, with a few notable excep- 
tions’, remained descriptive. Moreover, interneuron diversity was often 
treated either as a quasi continuum or a diversity space with cell types 
numbering potentially in the hundreds**. Studies from the past few 
years have coalesced into the surprising view that interneuron diversity 
may fundamentally be far more limited. When we consider their com- 
monalities at a genetic, circuit or functional level, an argument can be 
made for condensing large subclasses of interneurons into more finite 
groups. In this Review, we suggest that, on the basis of both develop- 
mental and functional criteria, interneuron diversity can be simplified 
and addressed experimentally. The differences in connectivity, gene 
expression and physiological properties of interneurons found across 
the brain seem enormous (Fig. 1), nevertheless, we argue that this com- 
plexity arises from a small number of non-overlapping cardinal classes. 
These represent developmental genetic ground states that can further 
specialize through their later interactions with other neurons. The 
ultimate goal of defining their identity through a set of computational 
principles remains daunting. However, with the advent of new tools that 
provide unprecedented targeting specificity, coupled with the means to 
manipulate the in vivo activity of targeted neural populations this goal 
is becoming attainable. 


I nterneurons, of all the cells within the forebrain, are the most diverse 


Birth and specification of interneurons 
How is neuronal diversity created? Developmental studies across vari- 
ous species”® and systems”* have suggested that cell diversity arises 
from specification programs established in progenitors that have been 
modified to varying extents by their subsequent post-mitotic interac- 
tions. The balance between genetic- and experience-dependent pro- 
cesses seems to represent a compromise dictated by the organizational 
constraints of the particular system. Within the cortex, pyramidal cells 
undergo a relatively orderly migration from the proliferative zone to 
the overlying cortical plate. As such, cell identities are largely controlled 
by programs established within progenitors’. By contrast, interneuron 
progenitors of the telencephalon undergo incredibly complex patterns 
of dispersion. At the extremes, this could either be due to exquisitely 
precise pre-programs for migration to particular structures or plasticity 
mechanisms that allow the progenitors to adapt to local environments. 
Until the late nineties, it was widely assumed that excitatory and 
inhibitory neurons within the cortex shared a common lineage. The 


seminal breakthrough came from the realization that interneurons 
originated within focal subcortical proliferative zones’. This first came 
to light with landmark papers showing that the GABAergic popula- 
tions from the ganglionic eminences migrated dorsally to populate the 
cortex’, as well as to all other structures within the telencephalon'’”. 
Subsequent work in the spinal cord, led to the conjecture that an under- 
standing of how specific subtypes are generated would come from a 
detailed analysis of gene expression within progenitors. It was assumed 
that combinatorial transcriptional codes in subpallial progentiors func- 
tioned to establish distinct cortical interneuron subtypes. 

The connection between developmental origins and interneuron 
diversity has steadily expanded over the past 20 years. Almost all 
GABAergic interneurons within the telencephalon arise from one of 
two embryonic subcortical progenitor zones, the medial ganglionic 
eminence (MGE) and caudal ganglionic eminence (CGE; Fig. 2). 
Moreover, those arising from each structure represent complementary 
interneuron subtypes™''"*. These major areas are augmented by special- 
ized subpopulations from the lateral ganglionic eminence’ and the pre- 
optic region®. It also became clear that there is a strong correspondence 
between interneuron classes and the specific progenitor zones that gives 
rise to them. Within the cortex, the MGE gives rise to the parvalbumin 
(PV)-expressing fast-spiking interneurons (including both basket and 
chandelier cells) and the somatostatin (SST)-expressing populations, 
of which the Martinotti cells form the largest subset’*'®"”. The CGE 
produces relatively rarer subtypes, including neurogliaform, bipolar 
and vasointestinal peptide (VIP)-expressing multipolar interneurons”. 

Genetic lineage analysis within the hippocampus reinforces the idea 
that specific interneurons arise from specific structures but demon- 
strates that a simple correspondence across forebrain regions is unten- 
able. For instance, although in the cortex neurogliaform neurons are 
CGE-derived, a large proportion of the corresponding population in 
the hippocampus arises from the MGE”™. Furthermore, whereas some 
classes, such as fast-spiking basket cells, show marked similarities across 
structures, others subclasses do not yet seem to have obvious paralogues. 
For instance, cholecystokinin basket cells, although a large population 
within the cortex and hippocampus, do not seem to be present within 
other brain areas”. Similarly, there seems to be at least two populations 
of the so called oriens/lacunosum-moleculare cells (named in accord- 
ance with the position of their cell body and dendrites’) that derive 
from distinct sources, one which expresses the ionotropic serotonin 
receptor 5HT3aR and one that does not. Adding further complexity, 
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Figure 1 | Multiple dimensions of interneuron diversity. Interneuron cell types are usually defined using a combination of criteria based on morphology, 
connectivity pattern, synaptic properties, marker expression and intrinsic firing properties. The highlighted connections define fast-spiking cortical basket cells. 


analysis of the basal ganglia suggests that only the MGE is a major source 
of interneuron populations within these structures”. 

These differences across areas raise two possibilities. First, there might 
be dedicated populations of interneuron progenitors that are commit- 
ted to populating specific brain structures. Second, the notion of refer- 
ring to an interneuron’s origin as deriving from a specific embryonic 
structure may be an imprecise proxy for gene expression. For instance, 
even though hippocampal neurogliaform cells arise from both the MGE 
and CGE, a common constellation of specification genes may be acting 
within both embryonic regions. Similarly, the differential expression 
of functional determinants such as serotonin receptors, in otherwise 
similar interneuron subtypes, are unlikely to represent distinct cardinal 
classes. Rather they probably represent iterations produced by cardinal 
cousins or differential post-mitotic interactions by members of a single 
cardinal class. 

These details emphasize the importance of mapping interneuron 
diversity onto molecular mechanisms. GABAergic lineages can be 
divided into those with long-range projections, such as those in the 
striatum or globus pallidus, and interneuron populations that largely 
project locally. A number of factors seem to be used within all GABAe- 
rgic neurons (Fig. 2), most notably the transcription factors encoded 
by Dix1 and Dlx2, Ascl2, and Gsx1 and Gsx2 that themselves form a 
regulatory network”. In the pallium (the region of the forebrain that 
will give rise to cortical structures), a similar cohort of transcription fac- 
tors, including those encoded by Emx1, Neurog1 and Neurog2, and Pax6, 
function analogously in the specification of the excitatory populations”. 

DIx1 and D/x2, in particular, function at multiple stages of GABAergic 
maturation: in the acquisition of GABAergic identity”, the initiation 
and cessation of tangential migration*”*”, and in the morphological 
and physiological maturation of specific subclasses”. The specific role 
of Dix1 and DIx2 in these disparate developmental activities has become 


clearer as their transcription targets have been identified. These include 
Elmo1, Dix5 and Dlx6, Arx and Gemin2 (or Zep2), each of which has 
been shown to be required in the control of migration and regional 
identity” *. Moreover, mutations of these genes, presumably through 
their requirement for interneuron function, contribute to a variety of 
affective psychiatric disorders”. 

In addition, a number of factors seem to be more restricted to spe- 
cific subtypes. Although far from complete, a genetic hierarchy for the 
MGE-derived PV and SST lineages has begun to emerge. Within the 
MGE, the cascade begins with Nkx2-1, which acts as master regulator 
in promoting MGE-derived interneuron fates over CGE-derived cell 
types****. Moreover, in the clearest example of a single gene contrib- 
uting to the generation of a specific interneuron subtype, chandelier 
or axo-axonic interneurons have been shown to arise relatively late 
in embryonic development (embryonic day 15 to 18 in mice) froma 
population of Nkx2-1 progenitors**””. In addition, Nkx2-1 is a gene with 
both activator and repressor function. Its repressor function attenuates 
the expression of CGE-specific genes, whereas its activator function 
induces the expression of Lhxé6 (ref. 17), which is needed to promote 
the differentiation of both PV- and SST-expressing interneurons. Lhx6 
in turn drives the expression of a series of factors including Sox6 and 
Satb1 (refs 38, 39) the products of which selectively affect the develop- 
ment of both PV- and SST-expressing interneurons. By contrast, so far 
only a few genes have proven to be specific to the CGE-derived line- 
ages. Collectively, the specific functions of these transcription factors 
and their targets are still a work in progress, but the tools to tackle this 
challenge are at hand. 

Although the emerging picture is exhilarating, rather than coalescing 
into an explanation for the myriad distinct subtypes that populate all 
areas of the forebrain (Fig. 1), it seems to only reveal a handful of genetic 
cascades. If the goal were simply to account for the neuronal markers 
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Figure 2 | Interneuron subtypes are generated from discrete proliferative 
regions within the subpallium. The progressive development of the 
telencephalon from an undifferentiated epithelium into discrete proliferative 
zones that produce particular interneuron populations and the specific genes 
involved at each stage. On the right are more anatomically accurate cross- 
sections of the progenitor zones from embryonic day 8 to 11, and a schematic 
of interneuron diversity in the cortex. Interestingly, although common 


that have conventionally been used to categorize interneurons, then a 
mere six (PV, SST, VIP, nitric oxide synthase, reelin and calretinin) could 
divide most interneurons within the forebrain. But clearly such a classi- 
fication would belie the regional complexity of interneurons. Within the 
hippocampus alone there are easily four or five SST-expressing cell types 
and at least three PV-expressing populations. Similar distinct subpopu- 
lations of SST- and PV-expressing populations have been discovered 
in the cortex, and more will probably be found. Although we believe 
that the cardinal specification of interneurons is only the first, albeit 
crucial, step in the progressive specification of subpallial progenitors, 
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proliferative zones produce the entire diversity of interneurons across all 
telencephalic structures, unique cell types and gene expression are seen in 
interneuron populations that reside in particular telencephalic structures. 
CCK, cholecystokinin; CR, calretinin; CGE, caudal ganglionic eminence; LGE, 
lateral ganglionic eminence; LTS, low threshold spike; MGE, medial ganglionic 
eminence; NGE nerve growth factor; NOS, nitric oxide synthase; OLM, oriens/ 
lacunosum moleculare; POA, pre-optic area; VIP, vasointestinal peptide. 


can our cardinal identity hypothesis account for this increasing wealth 
of interneuron subtypes? It is certainly possible that we have grossly 
underestimated the cardinal subtypes. Complex maps showing intri- 
cate embryonic patterns of gene expression within the subpallium have 
been posited to combinatorially specify different cell types”. However, 
we think that the cell types generated by developmental programs are 
unlikely to explain all of the regional diversity observed. First, the loss of 
specific genes results in phenotypes that are invariably not restricted to 
specific interneuron subtypes. Second, the loss of specific genes affects 
the generation of interneurons across various structures, arguing against 
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the existence of progenitor populations dedicated to the generation- 
specific interneuron classes. That said, it is possible to imagine a com- 
binatorial gene regulation strategy in which an individual gene could 
be necessary for a variety of disparate differentiation programs. Hence, 
another way to explore the question of whether regional diversity is 
established in progenitors is through lineage analysis. 


Interneuron lineages 

Are the interneuron populations that populate particular structures, 
such as the hippocampus or cortex, derived from dedicated progeni- 
tor pools? Two recent studies have directly explored the role of line- 
age in the development of cortical interneurons”. Both have shown 
that clonally related progenitors seem to be preferentially relegated to 
specific cortical columns or layers, supporting the idea that progeni- 
tor lineages are dedicated to producing lineages destined for particular 
brain structures. Given the long and convoluted paths taken by these 
progenitors as they transit to their mature position, it was stunning 
that clones could collectively target particular parts of the cortex. Inter- 
estingly, such clones were equally likely to be comprised of mixed SST- 
and PV-expressing interneurons rather than one subtype. Perhaps this 
lack of tendency for clones to ‘breed true’ should not come as a surprise. 
A wealth of lineage analysis in invertebrates”, as well as in the vertebrate 
retina’! and spinal cord”, indicates that neuronal lineages, although ste- 
reotyped, do not generally produce cells of a single subtype. Moreover, 
it will be interesting to explore whether in addition to being clustered, 
lineally related cells are also dispersed and, if so, to what extent®”*, If 
they are dispersed, it will be intriguing to assess the afferent and efferent 
connectivity of such clones, as this would help address the question: to 
what extent do intrinsic compared with local cues direct the connectiv- 
ity of interneurons? 


How circuits nurture interneuron subtypes 

The accumulated evidence supports a strong role for developmentally 
regulated genetic programs in the allocation of interneurons to broad 
cardinal classes. What it does not seem to explain is how interneurons 
from the same cardinal class are able to form connections with such a 
wide variety of synaptic partners. In favour ofa role for local cues con- 
tributing to this process, various studies have suggested that both excita- 
tory and inhibitory signals may influence the migration and positioning 
of developing interneurons”, Recent data have shown that attenuating 
the activity of specific interneuron populations affects both their migra- 
tion and morphological development™. Although the activity is class 
specific, whether it is instructive rather than simply permissive has yet 
to be demonstrated. That said, there seems to be growing support for 
the notion that local signals may direct the region-specific differen - 
tiation of interneurons. In support of this, a number of genes are spe- 
cific to interneurons within the cortex but not the striatum, including 


Long-range 
input 


Local input /\ 


Neuromodulatory 
input 


Figure 3 | Two faces of interneuron function. A cortical circuit from 
the perspective of a vasointestinal peptide (VIP) interneuron. The 
recruitment of VIP interneurons is constrained by the inputs they 
receive. The afferents can be local, long range from other cortical areas 
as well as neuromodulatory from the dorsal raphe and nucleus basalis 


REVIEW 


Zep2, DIx1, Elmo1 and Mef2c, the last three of which are activity regu- 
lated**”, Although it may be that activity simply promotes the matu- 
ration of interneuron populations that are already pre-specified, it is also 
possible that activity directs region-specific differentiation. Extensive 
work has indicated that voltage-gated calcium influx may result in de 
novo gene expression (reviewed in ref. 53). It is the failure at present to 
identify genes within proliferative populations that are indicative of 
region-specific differentiation that has led us to propose a two-phase 
model of interneuron specification. We envision that activity-regulated 
gene expression during critical periods may be responsible, during the 
second phase, for the allocation of cardinal classes into specific sub- 
classes. Recent work has shown that MGE-derived cells can produc- 
tively integrate into both normal and abnormal neuronal circuits”. 
This supports the idea that local cues can direct nascent interneurons 
to form appropriate connectivity with various synaptic partners. How 
interneurons form functional circuits in a variety of structures is a cru- 
cial question that remains to be answered. 

So far we have taken a bottom-up developmental view that is aimed 
at examining events by which interneuron subtypes are integrated into 
functional circuits. In the next section, we take a top-down view and 
examine circuit-specific functions of interneuron types. A prediction 
of our model is that the developmental genetic programs functioning 
in interneuron progenitors lead to the production of a relatively small 
number of cardinal subclasses. We believe that the much larger diversity 
of interneurons observed in mature brain circuits reflects later refine- 
ments imposed locally on specific subclasses. If this were true, it would 
predict that interneurons from the same cardinal class would, within 
the same circuit, be exposed to similar cues and hence develop similar 
functional properties. Although the data so far are in nascent stages, 
the availability of genetic driver lines*””* to reliably target particular 
interneuron cohorts, has provided the means to test this hypothesis. 


How interneurons function 

What is a meaningful measure of the function of an interneuron sub- 
type? Because interneurons generally project locally, their firing needs 
to be understood in the context of the circuits to which they contribute. 
There are two complementary ways of approaching the question of what 
an interneuron does: by examining when and how they are recruited to 
fire or by determining the impact of their firing on the circuit (Fig. 3). 
First, understanding recruitment requires us to determine under what 
circuit and brain-state configurations or behavioural contingencies is a 
given neuron active? This is strongly constrained by a neurons afferent 
connectivity, which in turn is probably dictated by its developmental 
genetic program. However, interneurons are not hardwired. A set of 
afferents that drive interneurons to fire in one context may fail to do so 
in another. Clearly, to understand their recruitment we must take into 
account a large number of factors. The classic idea is that interneurons 


Circuit impact 


through ionotropic receptors. The circuit impact of VIP interneurons is 
constrained by their outputs. The efferents are mostly to somatostatin 
(SST) interneurons and to a smaller degree to parvalbumin (PV) 
interneurons, which lead to the disinhibition of a functional subset of 
principal cells*'". 
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coordinate networks, such that their recruitment is best understood 
in reference to the local population activity (Fig. 4a). We also consider 
an alternative view that we term the flow-control hypothesis, whereby 
interneurons gate information flow within a given circuit and are excited 
at precise moments in reference to specific behavioural events (Fig. 4b). 
These two ideas are by no means mutually exclusive, as successful flow 
control must depend on the signals being suitably coordinated. Second, 
we can ask about circuit impact. How does the firing of an interneuron 
influence the activity of neurons in its local circuit? This aspect of func- 
tion is strongly constrained by the efferent connectivity of a neuron, 
which is thought to be dependent on its developmental genetic program. 
In addition, the impact ofinterneuron type will also greatly depend on 
whether their recruitment is coordinated with other neurons of the 
same cohort. 

Historically, models of cortical function have focused on circuit 
motifs, repeated patterns of connectivity, to infer computational func- 
tion for specific cell types”. Perhaps because inhibitory interneurons 
are largely local, they have generally been considered to simply guard 
excitatory networks against runaway excitation™. Recently, our under- 
standing of their function has become significantly more sophisti- 
cated. Among other lessons, we learned that interneurons could 
normalize the activity of local excitatory networks as well as provide 
feedforward inhibition. The latter strongly influences the timing of 
signals and allows excitatory signals to remain sub-threshold while 
carrying information. Of course, these are just two motifs out of a 
vast range of possibilities, including cross-coupling that can lead to 
synchronization, lateral inhibition that can segregate principal neuron 
populations, and disinhibition that can generate elevated activity. It 
has become clear that there are at least as many inhibitory circuit 
motifs as there are cell types. 


Computing with interneurons 

How then does the diversity of interneurons contribute to neural com- 
putations? First, it is worth noting that it would be difficult to imagine 
networks with only excitation. In fact, from an engineering standpoint, 
such networks would have to have extremely time-limited dynamics 
or they would become intrinsically unstable. Moreover, inhibition not 
only provides balance, it also ensures richness in the possible dynamics 
within networks of principal neurons. These considerations led to the 
idea that interneuron diversity allows for a vast increase in the compu- 
tational power of cortical circuits*’”. Broadly speaking, the computa- 
tional functions of interneurons can be grouped into either arithmetic 
or timing. 
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Figure 4 | Coordination and flow control hypotheses of recruitment. 
a, Coordination hypothesis. The bottom trace shows a local field potential 
representing the network state in the hippocampus. The firing of different 
neuron types (chandelier cell, light blue; basket cell, red; OLM cell, blue; 
pyramidal cell, brown) can be described in reference to the local field 
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A long-held idea is that different interneurons perform essentially 
arithmetic operations, such as subtraction or division®™. Inhibition can 
provide gain control by changing the input-output relationship between 
the excitatory drive and the resulting firing rate in principal cells: either 
by decreasing its slope divisively or by a subtractive shift. In turn, these 
elementary operations are the building blocks for cortical computations 
suchas normalization, an operation that provides divisive gain in propor- 
tion to the summed activity in a circuit™®*. Such gain modulation might 
result from shunting”, synchronous™ or balanced” inhibition. Originally 
proposed to explain early visual cortical responses, normalization has 
become one of the most studied cortical computations’. 

Another line of theoretical investigation has focused on the role of 
interneurons in controlling the timing of neural activity. More com- 
plex network functions require that neurons do not fire together. This 
can be achieved by dynamically balancing excitation with inhibition so 
that the resulting network activity becomes temporally irregular and 
asynchronous. These balanced networks thus provide rich dynamics 
and rapid responses”'. Indeed, cortical recordings often reveal finely 
balanced excitation and inhibition”, consistent with these models. 
When inhibition precisely tracks excitation, it can also increase tem- 
poral precision” or decorrelate networks”. 


Circuit impact of identified interneuron types 

To understand how computations are implemented in neural networks 
requires an appreciation of how distinct interneuron subtypes affect 
local networks. Although conventionally studied in vitro, we focus 
mostly on recent in vivo work using transgenic mice for targeting 
interneurons on the basis of markers, such as PV, SST and VIP*””*. It 
should be noted that these Cre-driver mouse lines neither demarcate 
entirely homogeneous interneuron populations nor map precisely onto 
cardinal interneuron types. Nevertheless, they provide a convenient 
and powerful tool for parsing interneuron heterogeneity because these 
three major markers delineate distinct non-overlapping populations 
and in aggregate can label around 85% ofall cortical interneurons”. 
When combined with optogenetic modulators*””, they have allowed 
researchers, for the first time, to test many long-held theories about 
the roles of inhibition. 

PV-expressing interneurons (either soma-targeting basket cells or 
chandelier cells targeting the axon initial segment) are strategically posi- 
tioned to control spiking, and are also strongly interconnected, which 
promotes their synchronous activity” *°. Recent studies have shown that 
this population controls the timing of spikes with respect to theta oscil- 
lations in the hippocampus*’’. In the visual cortex, optogenetic control 
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potential, both in terms of overall activity level and phase relationship*?’""”"". 


b, Flow control hypothesis. The bottom arrows mark the timing of four 
behavioural events: entry, exit, cue and reward. The firing of different neuron 
types (vasointestinal peptide, green; parvalbumin, red; somatostatin, blue; 
pyramidal cell brown) can be described in reference to these events*"”. 
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of PV-expressing interneurons can bidirectionally modulate the gain 
of visual responses”. Under some conditions, optogenetic activation 
of this population can even sharpen the tuning of cortical responses”. 

SST-expressing cortical Martinotti neurons are dendritic-targeting 
interneurons that project to layer 1 of the cerebral cortex and provide 
inhibition to the tufts of deep-layer pyramidal cells. Inhibition by Mar- 
tinotti cells strongly suppresses dendritic calcium spikes and burst- 
ing”, and can mediate di-synaptic inhibition between neighbouring 
pyramidal cells”*”*. Similarly, in the hippocampus, dendritic inhibition 
by SST neurons controls burst firing*’”**”*. In the cortex, SST-expressing 
interneurons have broader spatial tuning and can mediate surround 
suppression of visual responses”””’. However, SST-expressing neurons 
are anatomically diverse, with some subtypes specializing in disinhibi- 
tion of local principal cells”. 

VIP demarcates the third major class of interneurons, which con- 
stitute around 15% of all interneurons’*” and are mostly located in 
the superficial layers of cortex. VIP-expressing interneurons have 
long been proposed to mediate disinhibition””'”’. Recent studies have 
shown that in four different cortical regions, VIP interneurons tend 
to inhibit most SST and a smaller fraction of PV-expressing interneu- 
rons*’*»'” Inhibition of these interneurons in turn disinhibits prin- 
ciple cells in vivo, providing a form of gain control’. These results 
demonstrate that VIP-expressing neurons form a disinhibitory micro- 
circuit that is conserved across cortical regions with shared computa- 
tional functions (Fig. 3). 

A final example is provided by layer 1 neurons, which are almost all 
inhibitory interneurons”. Recent results have shown that two major 
interneuron types have opposing functions: neurogliaform cells inhibit 
layer 2/3 pyramids, whereas single bouquet cells inhibit other interneu- 
rons within layer 2/3, and may provide disinhibition in vivo". 

These studies injected great excitement into the field as they pro- 
vided, at last, the causal testing of hypotheses using optogenetics. 
In addition, they provided support for the hypothesis that cardinal 
classes of interneurons have defined circuit functions. However, they 
carry the caveat that they assume that these populations are synchro- 
nously activated under physiological conditions. Although often this 
is a reasonable assumption, it needs to be tested on a case-by-case 
basis by recording the requisite neural population during behaviour. 


Coordination and flow control during recruitment 

What are the brain-state and behaviour-dependent contingencies that 
determine when a specific interneuron type is activated? At the broadest 
scale, different behavioural modes are associated with large changes in 
global brain activity, therefore it is not surprising that different classes of 
inhibitory interneurons are activated in a highly state-dependent man- 
ner’ '®. Ata more refined scale, what is the simplest description of the 
conditions under which a given interneuron is activated? Is recruitment 
ofan interneuron subtype best understood with reference to a network 
state or a behavioural contingency? 


Network recruitment of interneurons 

One idea is that interneurons coordinate the precise timing of principal 
cell activation, such as network oscillations. There is a long history of 
experimental and theoretical investigations proposing that the diversity 
of interneuronal subtypes underlies a division of labour for organizing 
cortical population activity at different time scales*’'?'”. 

The best-studied examples of the contribution of different interneu- 
rons come from the hippocampus, in which recordings from targeted 
cells have been used to correlate their firing to network oscilla- 
tions'’”''*""®. These studies, mostly in anaesthetized animals, revealed 
that distinct interneuron subtypes fire during different rhythms (for 
example, theta, gamma and ripple) and with distinct phase relation- 
ships, suggesting that they differentially contribute to network dynam- 
ics. Thus it seems that the spike timing of different interneuron types 
can be referenced to specific network events?°”""” (Fig. 4a). More 
generally, this work supports the coordination hypothesis, whereby 
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each interneuron subtype performs as a temporal specialist within a 
‘distributed clock system’ that coordinates pyramidal cell ensembles’. 

These observations set the stage for testing the causal role of different 
interneuron subtypes in generating oscillations”. Recently, two stud- 
ies probed the role of PV interneurons using either optogenetic activa- 
tion''* or suppression'””, and found increased and decreased gamma 
oscillations, respectively, suggesting that interneurons are indeed 


actively involved in their generation. 


Behavioural recruitment of interneurons 

Analternative view is that the function of some interneuron types is better 
described with reference to behavioural events. Although interneuronal 
identity has long been inferred in behaving animals on the basis of spike 
waveform and firing pattern’), it is only recently that this could be 
directly ascertained for a handful of genetically defined interneurons dur- 
ing well-controlled behaviours. For instance, a recent study in the anterior 
cingulate cortex of mice reported the surprising observation that deep- 
layer PV-expressing and narrow-spiking SST-expressing interneurons 
responded in a functionally homogeneous manner at specific behavioural 
epochs™ (Fig. 4b). Similar observations were made in rat motor cortex 
using juxtacellular labelling in head-fixed rats: pyramidal cells responded 
heterogeneously, whereas all PV interneurons responded similarly at the 
moment of movement initiation’. This shows that PV interneurons, 
at least in mouse frontal regions, can be thought of as a functional unit. 
How could such functional homogeneity be achieved? One possibility is 
that inhibitory interneurons strongly sample local principal cell activity 
and their activation reflects a summary of local activity”. Therefore PV 
interneurons might fire in a behaviour- and region-dependent manner, 
which may be the ‘leaving decision in the anterior cingulate cortex, owing 
to its role in foraging behaviours’, but movement initiation in the motor 
cortex. 

Other examples of behaviourally activated responses come from 
the auditory cortex, in which a large fraction of interneurons in layer 
1 are activated by negative reinforcers during auditory fear condition- 
ing’. Similarly, VIP interneurons were observed to be strongly and 
uniformly recruited by negative- (air puff or mild shock) or positive- 
(water reward) reinforcement during an auditory discrimination task'” 
(Fig. 4b). Although such reinforcement feedback-related signals may 
at first seem surprising in a primary sensory area, as VIP interneurons 
mediate disinhibition (discussed earlier) they are an ideal conduit for 
gating signals” (Fig. 3). 

As we stated earlier, because interneurons are embedded in a highly 
interconnected network, their functions need to be understood in the 
context of local networks. In light of this, the observations that some 
interneuron types are recruited at specific behavioural events may seem 
puzzling. Indeed, consistent with the coordination hypothesis' 12117" one 
would expect that most responses would be constrained by the state of 
the local network, ona time scale of milliseconds, and not by behavioural 
contingencies. Mechanistically, the observed behaviourally specific acti- 
vation may reflect local network activity, which itself is tied to specific 
behavioural contingencies. We suspect that this explains the homoge- 
neous activation of deep-layer PV neurons. Nevertheless, their func- 
tion is most parsimoniously described by temporal reference to specific 
behavioural events. Alternatively, some classes of interneurons may be 
activated by strong long-range inputs. For instance, neuromodulatory 
systems can provide behaviour-dependent inputs to specific interneu- 
ron classes. Interestingly, VIP neurons have ionotropic receptors for the 
neuromodulators acetylcholine and serotonin, which probably drive 
reinforcement signals in these neurons”. 

At present, the behavioural recruitment of many interneuron types 
remains unexplored and different mechanisms may apply to each type. 
In contrast to the coordination hypothesis, some early results support 
the flow-control hypothesis (Fig. 4), proposing that distinct interneuron 
types specialize in controlling information flow in and out of local cir- 
cuits during specific behavioural contingencies, and thus acting much 
like controllers of a state machine. This suggests that the recruitment 
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of an interneuron type is linked with behavioural scale requirements 
of local circuits. Moreover, the observations that genetically defined 
interneuron classes show similar recruitment suggests they do act as 
functional units, supporting the existence ofa small number of cardinal 
interneuron types. 


Outlook 

Weare in the midst of an exciting era in which each week new data on the 
development and function of interneurons are being brought to light. In 
this Review, we suggest that the large diversity in interneuron classes may 
originate from a handful of cardinal cell types. Such an assertion could 
be misinterpreted as a statement claiming that interneuron classes are 
not diverse or that divisions into further subtypes are not warranted. The 
incredible work in areas such as the hippocampus shows us that this is 
patently incorrect. We have provided a framework that we hope will help 
to direct future studies by consolidating interneuron diversity into cardi- 
nal classes with specific ground states. Therefore if one wishes to explore 
questions regarding intrinsic physiological properties, axonal targeting 
or general target selection, understanding the ground state established 
in progenitors is a good place to start. However, to explore the circuit 
properties, connectivity or computational contributions ofa subclass of 
interneurons, one needs to consider the interplay between cardinal cells 
and the local cues received from the circuits that they contribute to. From 
a functional point of view, if we confine ourselves to specific circuits, 
such as VIP interneurons, a cardinal class will share important aspects of 
their function. Hence the tools to genetically target cardinal classes will 
prove invaluable for parsing the function of the different and more exotic 
interneuron subtypes they ultimately give rise to. 

It is intriguing to contemplate why such a mechanism is used to create 
cellular diversity. It may be that the strategy used by natural selection 
favours simple programs to provide stability, and combinatorial assem- 
bly to provide complexity. Although they are ultimately incorporated 
in a wide breadth of circuits, cardinal interneuron classes share crucial 
combinations of features that enable their function. Genetic programs 
direct the receptors they express, the cell types and subcellular compart- 
ment they innervate, as well as their firing properties. These features in 
turn strongly constrain interneuron recruitment and circuit impact. In 
short, we may ultimately find that interneurons exist as cardinal classes 
because nature has conspired to bestow on them generalized computa- 
tional function that necessitated the presence of common biophysical and 
hodological properties. Despite these commonalities, it is self-evident 
that neural circuits allow for a remarkable array of behavioural outcomes. 
Harnessing biology’s ability to use a limited set of building blocks, to cre- 
ate enormous diversity circuits holds the real promise that we may soon 
begin to understand the means by which brain circuits self-assemble and 
initiate function. m 
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The bone marrow niche for 
haematopoietic stem cells 


Sean J. Morrison! & David T. Scadden? 


Niches are local tissue microenvironments that maintain and regulate stem cells. Haematopoiesis provides a model for 
understanding mammalian stem cells and their niches, but the haematopoietic stem cell (HSC) niche remains incom- 
pletely defined and beset by competing models. Recent progress has been made in elucidating the location and cellular 
components of the HSC niche in the bone marrow. The niche is perivascular, created partly by mesenchymal stromal cells 
and endothelial cells and often, but not always, located near trabecular bone. Outstanding questions concern the cellular 
complexity of the niche, the role of the endosteum and functional heterogeneity among perivascular microenvironments. 


sues throughout development, beginning in the aorta~gonad- 

mesonephros (AGM) region and the yolk sac, followed by the 
placenta, fetal liver, spleen and bone marrow!. Postnatally, the bone mar- 
row is the primary site of HSC maintenance and haematopoiesis, but in 
response to haematopoietic stress the niche can shift to extramedullary 
sites. Defining niche components and how they work in concert to regu- 
late haematopoiesis provides the opportunity to improve regeneration fol- 
lowing injury or HSC transplantation and to understand how disordered 
niche function could contribute to disease. In this Review, we focus on the 
nature of the HSC niche in bone marrow because it is the subject of most 
of the recent research and controversies. 


| ] aematopoietic stem cell (HSC) niches are present in diverse tis- 


Historical context 

Following Darwins contributions to evolutionary theory, there was much 
emphasis on defining hierarchical evolutionary relationships among 
organisms. Morphological similarities were used to construct ancestral 
trees that connected complex multicellular organisms to an original 
monocellular “stem celle”. Lineage relationships were formulated, and 
biologist Ernst Haeckel proposed that cell organization in a developing 
organism was the recapitulation of events in the evolution of the species, 
with cells deriving from a stem cell equivalent’. Thirty years later, hae- 
matologist Artur Pappenheim proposed a less grand and more accurate 
formulation based on improved cell-morphology visualization techniques 
— that cells of the blood were related to one another, with mature cell 
types descending from a single cell type in a “unified view of haemat- 
opoiesis”*. In so doing, he articulated the hypothesis of tissue stem cells. 
This concept took about half a century to define experimentally through 
the inspired work of James Till and Ernest McCulloch, who showed that 
single cells could yield multilineage descendants while preserving the 
multipotency of the mother cell*”. The researchers gave substance to the 
idea of a stem cell and gave us methods to define the cardinal properties 
of those cells — self-renewal and differentiation. 

Till and McCulloch based much of their work on an in vivo spleen 
colony-forming (CFU-S) assay now known to measure mainly multi- 
potent progenitors rather than long-term self-renewing HSCs*”. The 
imprecise nature of that assay contributed to the formulation of the niche 
hypothesis by Ray Schofield in 1978. Recognizing that the putative CFU-S 
stem cells were less robust than cells of the bone marrow at reconstitut- 
ing haematopoiesis in irradiated animals, he proposed that a specialized 


bone marrow niche preserved the reconstituting ability of stem cells’®. 
His colleagues at the University of Manchester concurrently sought to 
define what made bone marrow a nurturing context for HSCs, and hae- 
matologist Michael Dexter showed that largely mesenchymal ‘stromal’ 
cell cultures could maintain primitive haematopoietic cells ex vivo". Fur- 
thermore, another colleague, Brian Lord, progressively reamed long bone 
marrow cavities and showed that primitive cells tended to localize towards 
the endosteal margins, leading to the hypothesis that bone might regulate 
haematopoiesis* (Fig. 1). 

These early studies were followed by in vitro evidence that osteoblasts 
differentiated in culture from human bone marrow stromal cells could 
produce haematopoietic cytokines and support primitive haematopoietic 
cells in culture’’. This fostered the idea that bone cells might create the 
HSC niche, but it was essential to move to engineered mouse strains to test 
the hypothesis in vivo. Two studies followed, including a mouse model in 
which a promoter that was restricted in activity to osteoblastic cells was 
used to drive expression of a constitutively active parathyroid hormone 
receptor”. Along similar lines, Linheng Li’ laboratory used a promoter 
that has since been shown to be restricted in bone marrow stroma to 
primitive and mature osteolineage cells’*, to delete the BMPr1a gene”. 
In both models, the number of endosteal osteoblasts and the number of 
primitive haematopoietic cells (scored as stem cells given the measures in 
use at the time) increased. These data provided the first evidence of spe- 
cific heterologous cells regulating mammalian stem cells in vivo, although 
it remained unclear whether the regulation was direct or indirect. This 
demonstrated that the niche was experimentally tractable, prompting a 
series of studies that have since refined our understanding of the complex- 
ity of the bone marrow microenvironment. 

Studies of the niche have now more precisely determined the com- 
ponents that regulate HSCs, and to some extent other haematopoietic 
progenitors, in the bone marrow. Like any interactive system there are 
complex regulatory relationships among cells in the bone marrow. A 
perturbation in one cell type that leads to an effect in another cell type 
does not necessarily require the interaction between the cells to be direct. 
The data now suggest that the early studies that observed effects on HSC 
frequency as a consequence of genetic manipulation in osteoblastic cells 
reflected indirect effects rather than the existence of an osteoblastic niche. 
Indeed, expression of constitutively active parathyroid hormone receptors 
in osteoblasts"* probably causes widespread changes in many cell types 
of the bone marrow, including in the vasculature. Current data suggest 
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there are specialized niches for distinct types of haematopoietic stem and 
progenitor cells, and that each niche may be created by multiple cell types 
that contribute to the niches in unique as well as redundant ways ” Indeed, 
there is heterogeneity among HSCs themselves’, raising the possibil- 
ity of cellularly distinct niches for distinct subpopulations of HSCs. This 
Review focuses on the current data and the unanswered questions. 


Mapping the bone marrow space 

Aniche is defined by anatomy and function” — a local tissue microenvi- 
ronment that directly maintains and regulates a particular kind of stem 
cell or progenitor”. Determining what cells neighbour HSCs and regulate 
HSC maintenance has been complicated by the difficulty in retaining 
histological integrity when sectioning bone, as well as the complexity of 
immunostaining methods that are necessary to identify HSCs. 

The identification of markers that reliably identify HSCs in vivo was an 
important step in defining the niche. Despite having the ability to isolate 
HSCs by flow cytometry for decades”, identifying HSCs within tissues 
has remained a challenge because the combination ofimmunofluorescent 
markers used to isolate HSCs by flow cytometry was too complex for 
microscopy. Consequently, markers of poor specificity were often used. 
For example, putative HSCs have been localized in the bone marrow using 
retention labels 5-bromodeoxyuridine (BrdU) or GFP-labelled histone 
H2B (H2B-GFP) as markers™”*. Although there is a subset of HSCs that 
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Figure 1 | Bone marrow anatomy. Haematopoietic stem cells (HSCs) reside 
mainly within bone marrow during adulthood. Bone marrow is a complex 
organ, containing many different haematopoietic and non-haematopoietic 

cell types, that is surrounded by a shell of vascularized and innervated bone. a, 
Minute projections of bone (trabeculae) are found throughout the metaphysis 
such that many cells in this region are close to the bone surface. b, The interface 
of bone and bone marrow is known as the endosteum, which is covered by 
bone-lining cells that include bone-forming osteoblasts and bone-resorbing 
osteoclasts. Arteries carry oxygen, nutrients and growth factors into the bone 
marrow, before feeding into sinusoids, which coalesce as a central sinus to form 
the venous circulation. Sinusoids are specialized venules that form a reticular 
network of fenestrated vessels that allow cells to pass in and out of circulation. 
There is a particularly rich supply of arterioles, as well as sinusoids, near the 
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preferentially retains H2B-GFP and BrdU, these markers by themselves 
have very poor specificity — most bone marrow cells that retain these 
labels are not HSCs'*"°”*, 

When positive staining for CD 150 was combined with negative staining 
for CD48 and CD41, HSCs could finally be highly purified using a simple 
two-colour stain”. All serially transplantable HSCs in young adult mice 
are contained within the CD150°CD48°CD417/CD41"™ population of 
bone marrow cells, including the most quiescent HSCs**”’. This made it 
possible to localize HSCs in sections through haematopoietic tissues using 
markers validated to give high purity. Most CD150°CD48 CD41 line- 
age cells in the bone marrow and spleen localize adjacent to sinusoid ves- 
sels, and nearly all are within five cell diameters of a sinusoid” (Fig. 2). 
HSCs are five times more likely than other haematopoietic cells to be 
immediately adjacent to a sinusoid”’. HSCs are distributed throughout the 
bone marrow, with less than 20% within 10 um of the endosteum?”"”?, 
Nonetheless, most HSCs are found in the trabecular region of bone mar- 
row, suggesting that HSCs, or their niche, may be directly or indirectly 
regulated by factors present near bone surfaces. 

The frequent localization of HSCs adjacent to blood vessels suggested 
that HSCs might be maintained in a perivascular niche by endothelial or 
perivascular cells”. But HSCs are mobile, regularly entering and exiting 
circulation™. This raised the possibility that the cells observed near vessels 
were in transit, perhaps delayed from entering or exiting circulation by 


Periosteum 


endosteum. c, Three-dimensional reconstructed photomicrograph from the 
bone marrow towards the endosteal surface (blue) from 50 um below the 
surface, revealing the rich network of vessels (red) (image courtesy of C. Lin, 

J. Spencer and J. Wu). Smaller arteriolar vessels (white arrows) become larger 
sinusoidal vessels. The field of view is 350 um x 350 um. d, A cross-sectional 
view of blood vessels that run along the endosteal surface (EV) and that 
transition (white arrow) into sinusoids (S) that then course towards the central 
sinus (adapted with permission from ref. 31). e, The bone marrow is cellularly 
complex with CD150°CD48 CD41 lineage’ HSCs (arrow) residing in close 
contact not only with vascular and perivascular cells (*, sinusoid lumens) but 
also megakaryocytes (large yellow cells) and other haematopoietic cells (image 
adapted with permission from ref. 125). In the enlargement on the right, 
CD150 is shown in red and CD48, CD41 and lineage are shown in green. 
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migrating through vascular barriers. This issue could not be resolved by 
histological analysis that captures a single moment in time. 

The sequential high-resolution imaging of mice assessed the three- 
dimensional position of cells in the calvarium over time”. These studies 
indicated that primitive haematopoietic cells trafficked to specific micro- 
domains of bone marrow blood vessels where the key HSC localization 
chemokine CXCL12 and the glycoprotein E-selectin were abundant, then 
remained in these positions for weeks, generating new cells as indicated 
by the partitioning of a cytosolic dye. When HSCs were visualized after 
transplantation into irradiated mice they preferentially localized near the 
endosteum, consistent with that region being particularly relevant for 
HSC maintenance”. However, it was subsequently found that irradia- 
tion disrupts sinusoids in the bone marrow”, raising the possibility that 
the only blood vessels preserved after irradiation are the arteriolar vessels 
near the endosteum. Therefore, the peri-endosteal localization of HSCs 
in these experiments may have reflected, in part, the destruction of sinu- 
soidal niches by irradiation. Overall, the localization data emphasized 
the possibility of a perivascular niche. How could this be resolved with 
historical data suggesting that the endosteum and osteoblasts were niche 
participants? 


Osteoblasts are more harbinger than host 

Although osteoblastic cells were the first cell population shown to influ- 
ence haematopoietic stem or progenitor cell frequency when perturbed 
in vivo'*"®, several lines of evidence raised concerns that the effect may not 
be direct. First, in vivo imaging studies using validated markers or labelled 
stem cells found few HSCs in contact with osteoblastic cells’”*"*****. Sec- 
ond, studies that depleted osteoblasts by Bgn deficiency” or osteoblastic 
cells by treatment with ganciclovir” or that increased osteoblasts by 


strontium treatment” had no acute effect on HSC frequency. The stud- 
ies in which osteoblastic cells were conditionally deleted by ganciclovir 
showed acute depletion of B lymphoid progenitors and only later showed 
a decline in a stem/progenitor cell population””’. Third, genetic modifi- 
cation of primitive osteolineage cells had an effect on HSC proliferation 
and differentiation, but the same modification in mature osteoblasts did 
not™. Finally, a key adhesion molecule thought to mediate osteoblast— 
HSC interaction, N-cadherin, was called into question. 

N-cadherin* HSCs were proposed to adhere to N-cadherin’ osteoblasts 
by homophilic adhesion’®”*, promoting HSC maintenance**; how- 
ever, these studies did not test whether deletion of the gene that encodes 
N-cadherin (Cdh2) affected HSC function. The levels of N-cadherin 
staining in HSCs were difficult to distinguish from background fluores- 
cence and depended on anti-N-cadherin antibodies that gave nonspe- 
cific staining in some haematopoietic cells**. Other studies failed to detect 
N-cadherin expression by HSCs using gene expression profiling”’””"* 
(https://gexc.stanford.edu/model/3/gene/Cdh2) quantitative reverse- 
transcription-PCR, flow cytometry with multiple anti-N-cadherin anti- 
bodies, western blot, or Cdh2:LacZ genetrap mice!®*’58, Conditional 
deletion of Cdh2 from HSCs or from osteoblast lineage cells had no effect 
on HSC frequency, HSC function or haematopoiesis**”*. Collectively, 
these data undermined the notion of an N-cadherin* ‘osteoblastic niche. 

Is there any role for osteoblasts or osteolineage cells in HSC regulation? 
Several lines of evidence suggest that this possibility remains viable but 
not as it was initially foreseen. First, higher numbers of HSCs reside in the 
trabecular rich metaphysis’’*’. This may simply reflect other components 
of bone marrow co-localizing with bony surfaces; however, conditional 
deletion of Sp7 (Osterix) results in chondrocytes without osteoblastic dif- 
ferentiation, increasing blood vessels and mesenchymal progenitors in the 


Figure 2 | Haematopoietic stem cells and their niche cells surround 
sinusoids throughout the bone marrow. a, Sections through the bone 
marrow of mice in which GFP had been knocked in to the Scflocus show that 
Scf-expressing cells (green) include mesenchymal stromal cells and endothelial 
cells that surround sinusoids and potentially other blood vessels throughout the 
bone marrow™. High magnification shows that b, Scf-GFP expression overlaps 
with c, the endothelial marker endoglin (shown by d, a merge of the two) 

but also extends beyond the endoglin on the abluminal side of the sinusoids, 
indicating expression by mesenchymal stromal cells. e, Scf-GFP (green) is not 


expressed by osteopontin” bone-lining cells (red) around trabecular bone, but 

it is expressed by some nearby perivascular cells. f, Cxcl12-DsRed (red) exhibits 
a similar expression pattern, primarily by perivascular mesenchymal cells and 
endothelial cells around sinusoids throughout the bone marrow, in a pattern 
that strongly overlaps with Scf-GFP (green) in mice with DsRed knocked into 
the Cxcl12 locus and GFP knocked into the Scflocus’”. Cells that are (g) CD150* 
and (h) CD48 and lineage’ are usually found immediately adjacent to (i) Scf- 
GFP" perivascular cells in the bone marrow. j, A merge of g-i. Images adapted 
with permission from refs 17 and 64. 
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Figure 3 | Haematopoietic stem cells (HSCs) and restricted haematopoietic 
progenitors occupy distinct niches in the bone marrow. HSCs are found 
mainly adjacent to sinusoids throughout the bone marrow”***!*, where 
endothelial cells and mesenchymal stromal cells promote HSC maintenance 
by producing SCF™, CXCL12 (refs 17, 33, 62) and probably other factors. 
Similar cells may also promote HSC maintenance around other types of blood 
vessels, such as arterioles. The mesenchymal stromal cells can be identified 
based on their expression of Lepr-Cre™, Prx1-Cre”, Cxcl12-GFP*’ or Nes-GFP 
transgenes” in mice and similar cells are likely to be identified by CD146 
expression in humans™. Perivascular stromal cells, which probably include 
Cxcl12-abundant reticular (CAR) cells®, are fated to form bone in vivo, 
express Mx-1-Cre and overlap with CD45/Ter119 PDGFRa’Sca-1* stromal 
cells that are highly enriched for mesenchymal stromal cells in culture”. It is 
likely that other cells also contribute to this niche, these probably include cells 
near bone surfaces in trabecular-rich areas. Other cell types that regulate HSC 
niches include sympathetic nerves’'”’, non-myelinating Schwann cells (which 
are also Nes*)”*, macrophages” and osteoclasts”. The extracellular matrix'™'”" 
and calcium” also regulate HSCs. Osteoblasts do not directly promote HSC 
maintenance but do promote the maintenance and perhaps the differentiation 
of certain lymphoid progenitors by secreting CXCL12 and probably other 
factors'*"”*?“°. Early lymphoid restricted progenitors thus reside in an 
endosteal niche that is spatially and cellularly distinct from HSCs. 


bone marrow but almost eliminating haematopoiesis in the metaphysis™. 
These data argue that the presence of mature or maturing osteolineage 
cells in regions with abundant endosteum is crucial for haematopoiesis. 
Indeed, mesenchymal progenitors that are capable of forming bone are 
sufficient to create bony ossicles that become invested by host vasculature 
and HSCs**™. This suggests that bone or bone-forming progenitors can 
promote the formation or maintenance of HSC niches (for example by 
recruiting vasculature to the bone marrow) even if they do not directly 
promote HSC maintenance. 

Transplanted haematopoietic stem or progenitor cells preferentially 
localize to blood vessels in endosteal regions even without prior cytotoxic 
conditioning”. Within the endosteal region, transplanted HSCs position 
themselves closer to the endosteal surface than progenitor cells”. These 
may again reflect indirect effects of bone-forming osteolineage cells as 
bone turnover results in high local concentrations of ionic calcium, and 
the calcium-sensing receptor promotes bone marrow engraftment by 
HSCs during development or after transplantation”. Osteolineage cells 
also elaborate cytokines and extracellular matrix proteins that may influ- 
ence a wide range of cell types, some of which may directly regulate HSC 
function. This is exemplified by parathyroid hormone receptor activa- 
tion, which induces expression of multiple regulatory molecules (such as 
IL-6, RANKL and Jagged1) by osteoblasts that can influence other cells 
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in the bone marrow, including the vasculature’*””. In addition, osteo- 
blastic expression of transgenes encoding the Wnt antagonists Dkk] and 
Wifl depletes HSCs**. Finally, the depletion of osteocalcin-expressing 
cells (osteoblasts or osteocytes) in vivo results in an inability to mobi- 
lize at least short-term repopulating cells to the blood using granulocyte 
colony-stimulating factor™*' despite osteoblasts having little expression 
of CXCL12 (refs 17, 33, 62). In aggregate, these data indicate that the 
endosteal region is important for haematopoiesis, but the mature osteo- 
lineage cells probably have an indirect role in modulating HSCs. However, 
these cells seem to be more important in directly regulating restricted 
progenitors, a topic discussed later. It is important, therefore, to refocus 
attention on the endosteum as a regulatory region and not on the osteo- 
blasts themselves (Fig. 3). 

The endosteum has a diverse group of cells and anatomical elements, 
including a rich endowment of arteriolar and sinusoidal blood vessels" 
(Fig. 1). The cells include endothelial cells as well as mesenchymal cells 
with osteolineage potential. These mesenchymal cells reside perivascu- 
larly but traffic to the endosteal surface to differentiate into osteoblasts. 
Undifferentiated mesenchymal cells around blood vessels may promote 
HSC maintenance throughout the bone marrow, but the mesenchymal 
cells around vessels in the endosteal region may differ from those distant 
from endosteal surfaces. 


Perivascular regulators of HSCs 

Given the localization of HSCs near blood vessels, it was crucial to define 
the stromal cells surrounding the vessels and to test whether they promote 
HSC maintenance. Attention focused on the mesenchymal cells that sur- 
round blood vessels throughout the bone marrow. Although mesenchy- 
mal stroma are likely to be heterogeneous, and the precise relationships 
between cells expressing various markers remain to be defined, perivascu- 
lar mesenchymal cells that express CD146 in humans™ and Cxcl12-GEP*, 
Nes-GFP®, full length Lepr-Cre™, Prx-1-Cre”, Sp7-Cre” and inducible 
Mx-1-Cre"* mice all generate osteoblastic cells and all express factors that 
promote HSC maintenance. CXCL12-abundant ‘reticular’ (CAR) cells 
adjacent to sinusoids were first shown to co-localize with HSCs through- 
out the bone marrow”. Ablation of Cxcl12-expressing bone marrow cells 
depletes HSCs as well as severely impairing the adipogenic and osteogenic 
capacity of bone marrow cells”. Human CD 146" skeletal stem cells also 
localize adjacent to sinusoids in the bone marrow and synthesize high lev- 
els of the HSC niche factors stem cell factor (SCF) and CXCL12 (ref. 54). 

The possibility that mesenchymal stem or stromal cells (MSCs) are 
part of the HSC niche was further supported by the finding that MSCs 
in the bone marrow express a Nes-GFP transgene and localize around 
blood vessels throughout the bone marrow”. HSCs commonly localize 
adjacent to Nes-GFP* cells and these cells express high levels of Scf and 
Cxcl12. Moreover, fibroblast activation protein (FAP) is expressed by bone 
marrow stromal cells that have many characteristics of MSCs, includ- 
ing Cxcl12, Scf, Pdgfra and Sca-1 expression”, and ablation of these 
FAP’ cells leads to bone marrow hypocellularity, anaemia and depletion 
of osteogenic cells*”. These studies provided strong evidence that MSCs 
are one component ofa perivascular niche for HSCs. 

Endothelial cells also contribute to the perivascular HSC niche”. The 
earliest functional evidence supporting this possibility was the observa- 
tion that conditional deletion of the gene that encodes the gp130 cytokine 
receptor in endothelial cells led to bone marrow hypocellularity and a 
reduction in HSC numbers”. Inhibition of VEGFR2 signalling in irradi- 
ated mice using a blocking antibody impaired the regeneration of sinusoi- 
dal endothelial cells and prevented the recovery of LSK stem or progenitor 
cells as well as CFU-S cells”’. Endothelial cells can promote HSC mainte- 
nance in culture”, and bone marrow sinusoidal endothelial cells promote 
long-term reconstituting HSC expansion in culture”. It has been sug- 
gested that E-selectin is exclusively expressed by endothelial cells in the 
bone marrow, and deficiency of the gene that encodes it renders HSCs 
more quiescent and resistant to irradiation”. These studies suggested that 
endothelial cells are one component of the HSC niche, but did not address 
whether they directly or indirectly regulate HSC maintenance in vivo. 
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To formally identify the niche cells, researchers examined which cell 
populations were the key sources of factors that promote HSC mainte- 
nance in vivo. For example, SCF has a non-cell-autonomous role for HSC 
maintenance in vivo”. Differential splicing and proteolytic cleavage 
yield membrane-bound and soluble forms of SCE HSCs are depleted in 
SI/SF mutant mice®, which express soluble SCF but not the membrane- 
bound form, indicating that this form is necessary for HSC maintenance”. 
Importantly, mice with a mixture of wild-type and S//SI stromal cells only 
exhibit normal haematopoiesis in the immediate vicinity of the wild-type 
cells, demonstrating that SCF acts locally in creating the niche. Because 
cell-cell contact is needed between HSCs and those that synthesize SCF, 
the niche could be localized by identifying the key sources of SCF for 
HSC maintenance. 

Analysis of the Scf expression pattern in mice in which GFP is knocked 
in to the Scflocus revealed that Scfis expressed perivascularly, mainly 
around sinusoids throughout the bone marrow™. Lepr’ perivascular stro- 
mal cells expressed the highest levels of Scfand endothelial cells expressed 
lower levels. Gene expression profiling suggested that these Lepr-express- 
ing perivascular cells were mesenchymal. Scf-GFP expression could not 
be detected in osteoblasts or in haematopoietic cells. Conditional dele- 
tion of Scf from perivascular stromal cells (Lepr-Cre) or endothelial cells 
(Tie2-Cre) depleted HSCs™. However, deletion of Scf from haematopoi- 
etic cells (Vav1-Cre), osteoblastic cells (Col2.3-Cre) and Nestin-expressing 
perivascular stromal cells (Nes-Cre and Nes-CreER) did not affect HSC 
frequency™. These results proved there is a perivascular niche for HSCs 
in which endothelial cells and mesenchymal cells promote HSC mainte- 
nance by synthesizing SCF (Fig. 3). 

It has been proposed that the endosteal region and its osteoblastic cells 
provide a unique zone for the maintenance of quiescent HSCs. How- 
ever, when Scf was conditionally deleted from both endothelial cells and 
perivascular mesenchymal cells in Lepr-cre, Tie2-cre, Scf"” mice, 85% 
of all long-term multilineage reconstituting cells, including all serially 
transplantable HSCs and all HSCs in the most quiescent subpopulation, 
were eliminated”. Therefore, even the most primitive and quiescent HSCs 
are maintained by a perivascular niche. Whether there are functionally 
distinct perivascular niches in different regions of the bone marrow, such 
as in the endosteal region, remains an open question. 

Are other key niche factors also synthesized primarily by perivascular 
cells? Cxcl12 is a chemokine that is required for HSC maintenance and 
HSC retention in the bone marrow******. Global deletion of Cxcl12, or 
the gene that encodes the Cxcl12 receptor, Cxcr4, depletes HSCs from the 
bone marrow®***”, Cxcl12 is primarily expressed by perivascular mesen- 
chymal stromal cells (CAR cells, Nes-GFP, Lepr-Cre or Prx1-Cre express- 
ing cells), with 100-fold lower levels of expression in endothelial cells and 
1,000-fold lower levels in osteoblasts’”**°"***’. Conditional deletion of 
Cxcl12 from perivascular mesenchymal cells using Prx1-Cre and Lepr-Cre 
depleted and mobilized HSCs, respectively”. HSCs were depleted but 
not mobilized when Cxcl12 was conditionally deleted from endothelial 
cells (Tie2-Cre)'”. HSC frequency and bone marrow retention were not 
affected when Cxcl12 was conditionally deleted from osteoblasts or their 
progenitors (Col2.3-Cre and Sp7-Cre), haematopoietic cells (Vav1-Cre), 
or Nes-Cre-expressing stromal cells'”. These data confirmed that HSCs 
reside in a perivascular niche in which mesenchymal stromal cells and 
endothelial cells each synthesize multiple factors that promote HSC main- 
tenance and localization. 

Although conditional deletion of Scfand Cxcl12 with Nes-Cre and Nes- 
CreER did not have any effect on HSC frequency’”™, Nes-GFP" perivas- 
cular cells are almost certainly part of the HSC niche. Each of these 
Nes alleles are transgenes with different expression patterns in the bone 
marrow™. Nes-Cre seems not to be expressed in the bone marrow and 
Nes-CreER exhibits very limited perivascular expression that does not 
resemble the expression patterns of Scf-GFP, Cxcl12-DsRed, Nes-GFP or 
Nes-Cherry™. However, Nes-GFP expression strongly overlaps with Lepr- 
Cre expression by perivascular cells throughout the bone marrow™”””. 
Thus, it is likely that Nes-GFP" perivascular MSCs are a component of the 
HSC niche even though Nes-Cre mediated deletion of Scf or Cxcl12 did 
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not deplete HSCs. Going forward, it will be useful to identify other Cre 
alleles that are specifically expressed in Nes-GFP" cells to compare their 
function with other perivascular stromal cells. 


Complexity of the perivascular HSC niche 

Endothelial cells and mesenchymal stromal cells are not the only cell 
types that regulate the perivascular HSC niche (Fig. 3). The sympathetic 
nervous system regulates CKCL12 expression and HSC retention in the 
bone marrow”’”’. This seems to be accomplished by sympathetic nerve 
fibres that likely synapse on perivascular cells around a subset of blood 
vessels in the bone marrow, conferring circadian regulation of CKCL12 
expression and HSC mobilization. Circadian oscillation in the clearance 
ofaged neutrophils by macrophages in the bone marrow also contributes 
to these circadian changes in CXCL12 expression and HSC circulation”. 
Consistent with this, macrophages modulate Cxcl12 expression by Nes- 
GFP* cells and HSC retention in the bone marrow™”*. Non-myelinating 
Schwann cells seem to regulate the niche by regulating TGF-f activation 
and potentially by secreting other factors”*. Osteoclasts, or osteoclast 
activity at the endosteum, may also influence HSC maintenance and bone 
marrow retention””*. Many different cell types are likely to directly or 
indirectly regulate the perivascular HSC niche. 

Given the complexity of cell types implicated in the regulation of HSCs, 
there is no singular niche cell. Rather, the niche integrates the function of 
multiple participants. It is important to bear in mind that niche composi- 
tion and niche function may change under different physiological condi- 
tions or in response to stress. It is also important to note that many of the 
Cre recombinase alleles used so far to study niche cells were active during 
development. Although this was necessary to achieve efficient gene dele- 
tion (temporally regulated CreER alleles tend to give much lower levels of 
recombination) and no abnormalities in development were noted, indi- 
rect effects on surrounding cell types and compensatory changes cannot 
be excluded. Even though endothelial cells and perivascular mesenchymal 
cells express SCF and CXCL12, conditional deletion of these factors from 
these cell types may have direct and indirect effects on HSCs. 

There may also be long-range signals circulating through the blood that 
regulate HSC or niche function, perhaps integrating stem-cell activity 
with overall physiology”. These may include hormones that signal repro- 
ductive or nutritional status, or even haematopoietic cytokines. For exam- 
ple, thrombopoietin is required for HSC maintenance". The main 
sites of thrombopoietin synthesis are in the liver and kidney, although it is 
also synthesized at lower levels by bone marrow stroma’”*'*, Conditional 
deletion experiments will be required to determine the physiologically 
important source or sources of thrombopoietin for HSC maintenance. 

There may also be functionally distinct perivascular environments 
in the bone marrow based on vessel type. Most studies of perivascular 
niches in the bone marrow have focused on sinusoids because they are the 
most abundant blood vessels in the bone marrow and most HSCs, MSCs, 
Scf-expressing cells and Cxcl12-expressing cells are in close proximity to 
them!”*”*35+!, However, other types of blood vessel, such as arterioles, 
may have an important role in HSC maintenance. A recent study reported 
that NG2*, but Lepr , mesenchymal cells that surround arterioles in the 
bone marrow are important for the maintenance of quiescent HSCs". 
This conclusion was based on the observation that HSCs were depleted 
and driven into cycle when NG2-CreER’ cells were ablated by treatment 
with diphtheria toxin. However, these data seem to conflict with the ear- 
lier observation that quiescent HSCs are eliminated from the bone mar- 
row when Scfis conditionally deleted using Tie2-Cre and Lepr-Cre, which 
recombine in endothelial cells and mesenchymal cells that are primarily 
around sinusoids throughout the bone marrow”. It will thus be interesting 
to determine whether HSCs are depleted when Scfis conditionally deleted 
using NG2-CreER or whether NG2-CreER is expressed by cells other than 
periarteriolar cells in the bone marrow. Similarly, it will be important to 
assess whether Lepr-expressing perivascular cells contribute to arteriolar 
niches in the bone marrow. In the end, perivascular niches associated 
with both sinusoids and arterioles may regulate HSC maintenance and 
quiescence in the bone marrow. Dissecting the diversity in perivascular 
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environments will require Cre alleles that are specifically expressed within 
distinct perivascular domains to map their functions. 

Evidence of HSCs residing within relatively hypoxic domains within 
the bone marrow" has partly been based on staining with pimonida- 
zole’'. Stained HSCs often reside adjacent to sinusoids in the bone marrow 
and are found next to cells that do not stain with pimonidazole’’. This 
suggests that pimonidazole staining does not reflect ambient oxygen or 
that it is cell-autonomously determined, rather than reflecting a hypoxic 
environment. Pimonidazole responds to reducing intermediates and may 
reflect more about the metabolic state of cells than ambient oxygen levels. 

Some researchers have interpreted the dependence of HSC mainte- 
nance on HIF-1a as suggestive of a hypoxic niche’”*. However, anum- 
ber of factors other than hypoxia regulate HIF-1a expression. A recent 
imaging study using a nanoprobe specifically reflective of ambient oxygen 
found that oxygen tension was lowest around sinusoids and highest near 
the endosteum™”. The entire bone marrow space had much lower levels 
of oxygen compared with vessels entering the bone marrow, a feature that 
was largely lost when haematopoiesis was ablated by cytotoxic drugs. It 
is therefore likely that consumption of oxygen during haematopoiesis 
renders the marrow hypoxic but that no distinct hypoxic region exists 
at the endosteum. 


Distinct haematopoietic progenitors have distinct niches 
HSCs reside within a specialized niche that is distinct from the niches that 
nurture other haematopoietic progenitors. For example, although osteo- 
lineage cells do not directly regulate HSC maintenance, they do regulate 
some B-cell lineage progenitors. Cultures enriched for osteoblasts support 
B lymphopoiesis and ablation of osteoblastic cells in adult mice acutely 
depletes some B lymphoid progenitors“””. Deletion of G,a in osteoblastic 
cells, which is necessary for parathyroid hormone receptor signalling, 
markedly depleted pro- and pre-B cells in a way that could be mitigated 
with IL-7 (ref. 111). About 30% of IL7R'lineage’ bone marrow cells, 
which are enriched for early lymphoid progenitors, localize immediately 
adjacent to bone-lining cells at the endosteum””. Conditional deletion 
of CXCL12, a factor that promotes the proliferation and maintenance of 
B-lineage progenitors**"’ and common lymphoid progenitors (CLPs)'”’, 
in osteoblastic cells depleted CLPs and certain other early lymphoid pro- 
genitors from the bone marrow without any effect on HSCs”. Therefore, 
some early lymphoid progenitors depend on an osteoblastic niche that 
is cellularly and functionally distinct from the perivascular niche that 
maintains HSCs. 

Other lineage-restricted niches may also exist. For example, mac- 
rophages seem to be crucial for erythroid maturation, and macrophage 
depletion reduces normal and malignant erythropoiesis'™*. Other cel- 
lular components of the erythropoiesis niche will have to be identified to 
understand the relationship between this niche and HSC and lymphoid 
progenitor niches. 

The approach of conditionally deleting specific niche factors from 
candidate niche cells and then examining the consequences for stem or 
progenitor cell maintenance in vivo offers the opportunity to map the 
niches for each stem cell and restricted progenitor in the haematopoietic 
system, limited only by the precision of the Cre alleles that are available. 


Novel niche factors 
In contrast to stem cells in some tissues, HSCs cannot be sustainably 
expanded in culture. This has impeded our ability to safely and effectively 
transplant HSCs in certain clinical contexts, such as during gene therapy, 
in which it would be useful to expand transfected HSCs in culture and 
verify the quality of the transfected HSCs before transplantation. One 
possibility why HSCs cannot be expanded in culture is the existence of, so 
far, unidentified growth factors that are synthesized by the niche in vivo. 
Some HSC niche factors have only recently been discovered. The addi- 
tion of pleiotrophin to culture promotes HSC maintenance'” and Pin 
deficiency is associated with HSC depletion and impaired haematopoietic 
regeneration after myelosuppression'’*. Pleiotrophin is synthesized by 
sinusoidal endothelial cells and Cxcl12-expressing perivascular stromal 
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cells, and has a non-cell-autonomous role in promoting HSC function"®. 
The Slit receptor Robo4, which is expressed by HSCs and endothelial cells, 
regulates HSC localization in the bone marrow'”"*. The Slit2 ligand is 
restricted to MSCs and possibly other osteoblast lineage cells. This sug- 
gests that pleiotrophin and Robo4-Slit2 are important elements of the 
perivascular niche. The glycoprotein tenascin-C'”’, osteopontin'’*”’”' and 
non-canonical Wnts” have also been reported to positively or negatively 
influence HSC numbers in the bone marrow and are among a number 
of factors that bear further characterization in terms of cellular source or 
role with respect to the niche. 


Perspective 

Ten years of experimentation has validated the niche concept and 
answered some first order questions about the molecular and cellular 
nature of the HSC niche in the bone marrow. The ‘parts’ list that make 
up this niche remains incomplete, but with the pace of current work it is 
likely that additional components will be defined and ambiguity about 
overlapping cell populations resolved over the next few years. This will 
make it possible to compare anatomically and developmentally distinct 
HSC niches that have different functions. The number of HSCs expands 
daily within the fetal liver but is sustained at nearly constant levels in the 
bone marrow, at least in the absence of injury. How components of these 
niches compare may inform methods for achieving HSC expansion. Simi- 
larly, comparing homologous niches among species, such as long-lived 
humans with short-lived mice, may provide insight into mechanisms for 
preserving the integrity of haematopoiesis under stress or in response to 
ageing. Finally, comparing niches among tissues will assess whether the 
mesenchymal and endothelial populations in brain, gut and skin share 
characteristics and functions with those defined in the bone marrow. Do 
diverse adult tissues consistently have perivascular niches for stem-cell 
maintenance? Do regenerative tissues have niches with common mecha- 
nisms for preserving self-renewal? Are there common components that 
can be engineered into niches ex vivo? 

With the detail now emerging in our understanding of the bone mar- 
row niche, a number of second order questions can be addressed. Increas- 
ingly, niche cells can be genetically tagged or modified, allowing both 
quantification and molecular manipulation. Coupled with high resolu- 
tion real-time imaging and well-validated methods to measure haemat- 
opoiesis, it is becoming possible to systematically elucidate how the niche 
responds to stresses or physiological changes to mediate changes at the 
stem-cell and tissue levels. When stressed by infection, myeloablation 
or neoplasia, what niche components change in number or function to 
modify haematopoiesis? Is there a hierarchy of niche components that 
determine these responses? Can such information allow predictive algo- 
rithms that guide specific interventions to achieve desired outcomes? 

Another set of questions concerns the manner in which the niche 
participates in diseases of stem-cell failure, such as aplastic anaemia or 
neoplasia. The niche may be hostile to normal progenitors in those dis- 
ease states and, with neoplasia, undergo a facultative response to support 
altered haematopoiesis’”. Can changes in the niche be a primary but non- 
cell-autonomous driver of neoplasia in humans as has been suggested by 
animal models*””*'”“? The potential for unravelling how the microen- 
vironment participates in normal and disease physiology is at hand and 
promises new approaches to haematological disorders. m 
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Mitochondrial form 
and function 


Jonathan R. Friedman! & Jodi Nunnari! 


Mitochondria are one of the major ancient endomembrane systems in eukaryotic cells. Owing to their ability to produce 
ATP through respiration, they became a driving force in evolution. As an essential step in the process of eukaryotic evo- 
lution, the size of the mitochondrial chromosome was drastically reduced, and the behaviour of mitochondria within 
eukaryotic cells radically changed. Recent advances have revealed how the organelle’s behaviour has evolved to allow the 
accurate transmission of its genome and to become responsive to the needs of the cell and its own dysfunction. 


ment of an a-proteobacterium by a precursor of the modern 

eukaryotic cell’. Although mitochondria have maintained the 
double membrane character of their ancestors and the core of ATP pro- 
duction, their overall form and composition have been drastically altered, 
and they have acquired myriad additional functions within the cell. As 
part of the process of acquiring new functions during evolution, most of 
the genomic material of the a-proteobacterium progenitor was rapidly 
lost or transferred to the nuclear genome’. What remains in human cells 
is a small, approximately 16 kilobase, circular genome, which is present in 
cells in a vast excess of copies relative to nuclear chromosomes. 

The human mitochondrial genome contains genetic coding informa- 
tion for 13 proteins, which are core constituents of the mitochondrial 
respiratory complexes I-IV that are embedded in the inner membrane. 
Functioning together with the Krebs’ cycle in the matrix, the respiratory 
chain creates an electrochemical gradient through the coupled transfer 
of electrons to oxygen and the transport of protons from the matrix 
across the inner membrane into the intermembrane space. The elec- 
trochemical gradient powers the terminal complex V of the chain, ATP 
synthase, which is an ancient rotary turbine machine that catalyses the 
synthesis of most cellular ATP. The electrochemical potential is har- 
nessed for additional crucial mitochondrial functions, such as buffering 
the signalling ion Ca through uptake by a uniporter in the inner mem- 
brane**. A reduction in the electrochemical potential of mitochondria 
in cells has evolved as a read-out for mitochondrial functional status, 
which, as discussed later, creates signals to activate pathways that repair 
and/or eliminate defective mitochondria. 

We know from a combination of proteomics, genomics and bioin- 
formatics that modern day mitochondria are comprised of well over 
1,000 proteins; the composition is plastic in nature, varying with and 
between species in response to cellular and tissue-specific organismal 
needs” ’. The origin of the mitochondrial proteome is a mixture of 
‘old’ bacterial and ‘new’ eukaryotic-derived proteins’. For example, the 
mitochondrial DNA (mtDNA) replication and transcription machines 
have distinct evolutionary origins in bacteriophage*"°, whereas the 
mitochondrial translational machinery has a clear evolutionary rela- 
tionship to bacteria'’. In addition to protein components, the mito- 
chondrial genome encodes 22 transfer RNAs and 2 mitochondrial 
ribosome-coding RNAs, which are essential components of its own 
translational apparatus. Mitochondrial ribosome assembly in the mito- 
chondrial matrix is a relatively complex and highly regulated process, 
which involves mitochondrial ribosome-coding RNA processing and 


Meester arose around two billion years ago from the engulf- 


maturation and the assembly of mitochondrial ribosomal proteins into 
small and large subunits'*. However, only a fraction of mitochondrial 
ribosome proteins have identifiable homologues in bacteria’’. The roles 
of mitochondrial specific ribosomal proteins are not understood, but 
these proteins are thought to have evolved to regulate the coordination 
of mitochondrial translation with extra-mitochondrial pathways in 
eukaryotic cells. Thus, like many mitochondrial machines, the ribo- 
some is a mix of old and new innovations. 

The nucleus-encoded proteins that make up most of the mitochon- 
drial proteome are translated on cytosolic ribosomes and actively 
imported and sorted into mitochondrial sub-compartments by 
outer and inner membrane translocase machines in a manner that is 
dependent on the electrochemical potential’*"*. Transcriptional, post- 
transcriptional and post-translational modes of regulation exist for 
nucleus-encoded mitochondrial proteins. In humans, transcriptional 
regulation of mitochondrial biogenesis occurs through the action of the 
PGC-1 family of co-activators, which respond to changes in nutrient 
status, such as NAD*/NADH and AMP/ATP ratios (sensed through 
SIRT1 and AMPK, respectively), as well as environmental signals'®””, 
Combinatorial interactions between PGC-1 co-activators and specific 
transcription factors (NRF1, NRF2 and ERR) balance and specify the 
major functional pathways within mitochondria. Through the induc- 
tion of nuclear genes that directly impinge on the maintenance of 
mtDNA, these interactions coordinate the nuclear and mitochondrial 
genomes’*. Evidence in yeast suggests that nuclear-transcribed messen- 
ger RNAs encoding mitochondrial proteins are post-transcriptionally 
localized to the mitochondrial outer membrane in a highly regulated 
spatial and temporal manner, and coordinately translated’*””. Although 
the underlying molecular mechanisms of mRNA targeting to mitochon- 
dria are poorly understood, such pathways will probably be important in 
polarized cells such as neurons. Post-translational modifications, such 
as phosphorylation of mitochondrial import machinery components 
by cytosolic kinases, ultimately fine-tunes the proteome in response to 
metabolic cues”. 

Mutations in either mtDNA genes or nuclear genes that encode the 
mitochondrial proteins required for aerobic ATP production cause a 
diverse and often devastating array of human mitochondrial diseases, 
which can affect any organ in the body at any point during a person’s 
life’. In addition, there is a high degree of clinical heterogeneity in 
mitochondrial diseases. Some of this heterogeneity can be explained 
by the fact that human cells can contain a variable ratio of mutated and 
wild-type mtDNA, a state called heteroplasmy. This seems to be the 
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Figure 1 | The organization and distribution of mitochondria and mtDNA 
in higher eukaryotes. Mitochondrial organization is a conserved feature. 
a, mtDNA in a human fibroblast is packaged within nucleoids (green) 
distributed within tubular mitochondria (red) around the nucleus 

(blue). Scale bar, 20 microns. Adapted with permission from ref. 40. b, A 
similar distribution is seen ina yeast cell with nucleoids (green) within 
mitochondria (red). Scale bar, 2 microns. c, Mitochondrial copy number is 


case for mtDNA mutations in protein-coding regions of the mitochon- 
drial genome, in which an increase in mutant load gives rise to more 
severe disease phenotypes. However, disease heterogeneity that cannot 
be explained by heteroplasmy also exists when mutations are present 
in non-coding mitochondrial tRNAs. In addition, mutations in genes 
with a shared function, such as genes encoding subunits of complex I 
of the respiratory chain, lead to disease manifestations that are vastly 
different, such as optic nerve atrophy in adults or encephalopathy in 
infants~*™*. Recent studies also point to a causal link between muta- 
tions in mtDNA and ageing, probably resulting from mtDNA-linked 
defects in somatic stem cells””*. 

The role of mitochondria in disease has been expanded beyond 
the respiratory chain, as defects in additional mitochondrial func- 
tions and behaviours have been linked to cancer, metabolic disorders 
and neurodegenerative diseases, such as Alzheimer’s, Parkinson’s 
and Huntington's disease”. In general, however, our current under- 
standing of the underlying relationship between mitochondrial phe- 
notype and disease is poor and requires a better understanding of 
mitochondrial organization, as well as the connections mitochondria 
have with the nuclear genome and extra-mitochondrial pathways in 
different cell types and at the organismal level. To address this defi- 
cit, a renaissance in mitochondrial research has emerged, hastened 
by recent advances in genetics, systems-based approaches and our 
ability to visualize mitochondria at high temporal and spatial resolu- 
tion. In this Review, we discuss how this renaissance is both redefin- 
ing and extending our knowledge about mitochondrial behaviour 
and communication. 
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controlled by the combined actions of mitochondrial division and fusion. 
Mitochondrial division is controlled by the assembly of a dynamin-related 
protein (DRP) on the outside of the organelle into a helical structure, 
which mediates scission through interactions across helical rungs (marked 
by orange circles). Mitochondrial fusion is controlled by interactions of 
outer and inner membrane fusion DRPs (blue and red circles, respectively) 
across mitochondrial membranes. 


The mitochondrial chromosome 

Given the importance of mtDNA-encoded genes for mitochondrial 
function, it is not surprising that there are dedicated mechanisms that 
actively control the structure and distribution of mitochondria and 
mtDNA, but in higher eukaryotes, these mechanisms are divergent from 
those of their ancestors”. Unlike bacteria, in most cell types, individual 
mitochondria do not exist; instead, they comprise a connected network 
containing multiple copies of the mitochondrial chromosome, form- 
ing a ‘syncytium’ (Fig. 1a, b). Like bacterial and nuclear chromosomes, 
mtDNA is highly compacted within the mitochondrial matrix, and 
consequently mtDNA-protein complexes can be visualized as punctate 
structures, termed nucleoids, within networks. The mechanism under- 
lying mtDNA condensation was illuminated by researchers who solved 
the crystal structure of the most abundant mtDNA-associated protein 
in mammalian cells, mitochondrial transcription factor A (TFAM). The 
structure indicates that TEAM both binds and bends short stretches of 
mtDNA, forming loops that allow mtDNA packaging””*. TEAM also 
plays a crucial part in mtDNA transcription, and its expression controls 
mtDNA copy number in cells, making it a central player in mtDNA 
maintenance and transmission”. 

Additional proteins that are crucial for mtDNA maintenance are 
localized at nucleoids. These proteins include replication and repair 
machinery, which, in humans, includes DNA polymerase y and its 
accessory proteins, such as the replication helicase twinkle”. Mutations 
in genes encoding these and additional factors required for mtDNA 
maintenance cause a spectrum of human mitochondrial diseases, but 
at the molecular level, it is still not understood how these and the many 
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other identified mtDNA-associated proteins are assembled together and 
organized to build nucleoids*. This is despite the fact that proteomic 
inventories of mtDNA-associated proteins have been determined for 
many species®**. In addition, both higher order nucleoid organiza- 
tion and the mode of mtDNA replication are variable between biologi- 
cal kingdoms, resulting in further complexity. These differences are a 
consequence of the composition of the genome and the nucleoid. For 
example, yeast possesses an active recombination machine similar to 
Rad52 recombination proteins, and replication is probably primarily 
recombination driven”. As a result, relative to the human mitochondrial 
genome, the larger 80 kilobase yeast genome is packaged in multiple 
copies in the nucleoid. By contrast, the mechanism of human mtDNA 
replication is recombination independent in most tissues and occurs 
through strand displacement”. Super-resolution imaging consistently 
indicates that human nucleoids contain a relatively small number of 
mtDNA molecules and thus are more solitary in nature”. These dif- 
ferences in organization and transmission modes of the mitochondrial 
chromosome greatly impact the segregation behaviour of mtDNA. 


Modes of mtDNA segregation 

The multi-copy nature of mtDNA means the mode of mtDNA trans- 
mission is viewed as stochastic or ‘relaxed’ in most cell types and thus is 
radically different from that of nuclear genes“. On an organismal level, 
in humans for example, mtDNA is inherited in a uniparental maternal 
manner, and paternal mtDNA is actively destroyed after fertilization”. 
Mitochondria in egg and sperm also have different functional states, 
shapes and cellular distributions, and these differences are probably 
important to confer fitness. In addition, from generation to generation 
mtDNA genotypes rapidly segregate, indicating that a ‘bottleneck’ exists 
to potentially eliminate severely defective mitochondria and/or mtDNA. 
In oocytes, the bottleneck is partly due to the manner in which mtDNA 
is replicated, as well as a consequence of mitochondrial organization. 
Oocyte mitochondria are organized into a transient structure called a 
Balbiani body, comprised of other organelles, such as the endoplasmic 
reticulum (ER) and Golgi, but the biogenesis of this structure is poorly 
understood™. During the reprogramming of fibroblasts into induced 
pluripotent stem (iPS) cells, heteroplasmic mtDNA genotypes also 
segregate through a bottleneck and mitochondria are organized into a 
Balbiani-like structure®, suggesting that iPS cells could be a useful tool 
with which to study the cellular and molecular mechanisms of mtDNA 
genotype segregation during development. 

Nuclear genes are replicated during a finite phase of the cell cycle 
and segregated by the concerted action of a microtubule-based spindle 
apparatus and an actin-based cytokinesis machine that work together 
to physically partition chromosomes into daughter cells. Bacterial cells 
also possess cell-cycle mechanisms to coordinate cell division with 
chromosome segregation through the placement ofa tubulin-like FtsZ 
cell division machine. By contrast, the replication and segregation of 
mitochondrial chromosomes within most eukaryotes is not strin- 
gently coupled to the cell cycle, and at any given time, the replication of 
mtDNA occurs for only a subset of nucleoids ina given cell”*. Bacterial 
cytoskeletal machinery has been lost during mitochondrial evolution, 
raising the question: what mechanisms are used to place division sites 
and segregate mtDNA? Such mechanisms will probably be important 
for understanding the cell- and tissue-specific mechanisms that underlie 
diseases linked to mtDNA mutations. 


Dynamin-mediated mitochondrial dynamics 

In higher eukaryotes, the segregation of mtDNA at the cellular level 
partly depends on continuous division and fusion events (Fig. 1c), 
whose rates are responsive to the needs of a particular cell type”. One 
fundamental role of mitochondrial fusion is to allow communica- 
tion between organelles, perhaps to facilitate access to products of 
mtDNA expression. Mitochondrial fusion also serves as a mech- 
anism to buffer transient defects that arise in mitochondria”’. Mito- 
chondrial division antagonizes fusion-driven network assembly to 
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facilitate mitochondrial distribution and transport by motor proteins on 
cytoskeletal networks to and from distal locations of demand”™. A bal- 
ance between division and fusion is important for the distribution and 
maintenance of mtDNA. Loss of mitochondrial fusion causes a normally 
connected network to fragment into multiple small mitochondria owing 
to unopposed division, and mtDNA is either completely or partially lost 
from cells with the associated severe defects in oxidative phosphoryla- 
tion®”’. Attenuation of mitochondrial division causes mitochondria 
to elongate and form highly interconnected net-like structures, as well 
as causing defects in oxidative phosphorylation and mtDNA loss dur- 
ing cell division’. The link between mitochondrial dynamics and 
mtDNA transmission is consistent with the primary role of dynamics 
in the control of mitochondrial copy number. The more distributive 
nature of mitochondrial division coupled with opposing fusion has thus 
evolved to replace ancestral bacterial cytoskeletal machines. 

Mitochondrial division and fusion events are mediated by the action 
of highly conserved dynamin-related proteins (DRPs) that, through 
their ability to self-assemble and hydrolyse GTP, facilitate membrane 
remodelling of diverse intracellular membranes”. Mitochondrial divi- 
sion is catalysed by a single DRP, DRP1 in mammals (Dnm1 in yeast). 
DRP1 and Dnm1 assemble through stalk domains into helical structures 
that wrap around the outer surface of mitochondria at constriction sites, 
whose diameters match the diameters of the division helix”. Interac- 
tions between the GTPase domains of division DRPs across the helical 
rungs catalyse the hydrolysis of GTP — an event thought to produce 
conformational changes that are transduced through the DRP helix to 
allow the coordinate scission of the outer and inner membranes”. 
Fusion of the mitochondrial outer and inner membranes requires the 
action of two evolutionarily distinct integral membrane DRPs, MFN1/ 
MEN2 in mammals (Fzol in yeast) and OPA1 in mammals (Mgm1 in 
yeast), respectively™. Much less is known about how fusion DRPs work 
at a mechanistic level, although it is likely that interactions between the 
GTPase domains on opposing membranes are harnessed for membrane 
tethering and that self-assembly through the proposed stalk-like regions 
within a membrane is used for fusion. 

The DRP family originates in bacteria, for which evidence suggests 
that members also function in membrane-linked processes™. However, 
DRPs acquired roles as the primary machines that control mitochon- 
drial copy number are a radical divergence from the bacterial FtsZ- 
dependent division machine, which works from the cytosolic face of the 
plasma membrane to mediate constriction and fission (Fig. 2). Insight 
into this transition to DRP-driven division comes from the division 
machines of primitive eukaryotic organelles from organisms such as the 
red algae Cyanidioschyzon merolae, and from endosymbiotic plastids 
and chloroplasts of most photosynthetic eukaryotes. These endosymbi- 
otic organelles possess hybrid division machines, containing both inter- 
nal FtsZ and external DRP assemblies”. The FtsZ machine functions 
in constriction and scission of the inner and outer membranes and in 
positioning the division site, whereas the DRP machine is recruited to 
the outer surface of the organelle — at the constriction site — and func- 
tions relatively late in the process to complete the scission of the outer 
membrane. As in the case for bacteria, FtsZ-dependent division site 
placement in these mitochondria and plastids is crucial for the transmis- 
sion of organellar genomes. 


ER-associated mitochondrial division 

The loss of the FtsZ-like machine in higher eukaryotes raises ques- 
tions of how and where division sites are placed in mitochondria and 
whether division site placement is important for mtDNA transmission. 
The answers have been partially addressed by the discovery that mito- 
chondrial division site placement is dependent on a key inter-organellar 
interaction with the ER. Before DRP1 recruitment to the mitochon- 
drial outer membrane, ER tubules wrap around mitochondria and mark 
sites of mitochondrial division — a phenomenon, termed ER-associ- 
ated mitochondrial division (ERMD), that has been conserved from 
yeast to humans. At these sites, mitochondria are constricted, and thus 
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geometric hotspots for the assembly of the division DRP helix are also 
created. In addition, the integral outer membrane DRP1 receptor and 
effector MFF® is recruited to sites of contact, which provides a spatial 
mark to link DRP1 recruitment to its activation and assembly into a 
division machine. Neither the mechanism underlying the generation of 
such an ER-mitochondrial microdomain nor of ER-associated mito- 
chondrial constriction is understood. It is possible that the ER is able to 
directly alter mitochondrial membrane composition and/or morphol- 
ogy to facilitate the recruitment of factors localized on the outside and/ 
or inside of mitochondria that promote mitochondrial constriction and 
division. In mammalian cells, the actin cytoskeleton has been implicated 
in ERMD, potentially through the ER-localized isoform of the formin 
INF2, raising the possibility that mitochondrial constriction during 
division is actin mediated”. 

ERMD must also involve a link or tether between the two organelles. 
The molecular basis of this link has recently been elucidated in yeast, 
and is mediated by a multiprotein complex termed the ER-mitochon- 
drial encounter structure (ERMES). This structure forms a discrete 
and finite number of interfaces between the ER and mitochondria 
in cells’’”*, In addition to marking sites of division, ERMES struc- 
tures are tightly linked to a subset of nucleoids engaged in replicating 
mtDNA“*”, potentially as components of a larger structure that spans 
multiple membranes. At sites of an ERMES complex, nucleoids seg- 
regate by an unknown mechanism and, in most cases, are distributed 
into both tips of divided mitochondria”. In this context, the ERMES 
complex has also been implicated as a bridge between mitochondria 
and the actin network, suggesting that it may link and coordinately 
drive nucleoid segregation, mitochondrial constriction during divi- 
sion, and mitochondrial distribution after division”. Thus, the process 
of ERMD and nucleoid segregation in yeast may fundamentally be 


a Bacterial division b 
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Figure 2 | Evolution of mitochondrial division site placement 
mechanisms. a, Roles of FtsZ and dynamin-related proteins (DRPs) in 
bacterial and endosymbiotic organelle division and division site placement. In 
the a-proteobacterial ancestor of mitochondria, an FtsZ ring is placed mid-cell 
by active mechanisms. The combined actions of the FtsZ-containing ring 

and cell-wall synthesis are essential for cell division. In chloroplasts and algal 
mitochondria, FtsZ-dependent placement (indicated by arrows) and division 
mechanisms on the inside of the organelle have been retained during evolution. 
However, in these organelles, DRPs also function on the cytosolic surface in 
organelle division, perhaps replacing the requirement for cell-wall synthesis in 
division. In yeast or animals, DRPs function on the cytosolic surface in organelle 
division. Before DRP recruitment, however, the endoplasmic reticulum (ER) is 
associated with division-site placement and constriction on the outside of the 
organelle, potentially replacing FtsZ-dependent placement and constriction. 
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related to the role of actin in ERMD in mammalian cells. 

The distribution of daughter mitochondria following ERMD in yeast 
requires the highly conserved Miro GTPase Gem] (ref. 72). Gem1 may 
function with ERMES to promote the resolution of daughter mitochon- 
dria by recruiting motility factors to mitochondrial tips after division. 
The metazoan Gem1 orthologues, MIRO1 and MIRO2 (hereafter 
referred to as Miro), also function in mitochondrial distribution. In 
this case, Miro proteins have been proposed to connect mitochondria 
to amember of the Milton/TRAK protein family of kinesin-1 adaptors 
to allow the microtubule-based transport of mitochondria’. However, 
although the Miro GTPase family is remarkably conserved, the mecha- 
nisms of mitochondrial transport are divergent in eukaryotes and, in 
yeast, mitochondrial motility is actin dependent. Thus, it is possible 
that the role of Gem1 and Miro GTPases in mitochondrial distribu- 
tion may instead be to allow motility by directly regulating molecular 
tethers to disengage mitochondria from the ER at sites of division. In 
any case, Gem1-dependent distribution of daughter mitochondria after 
mitochondrial division serves as a cellular mechanism to coordinately 
distribute mitochondria and mtDNA. 


Internal determinants of mitochondrial division 

The observations discussed paint a compelling picture in which ERMD 
positions the division plane adjacent to mitochondrial nucleoids to bias 
their distribution into newly generated daughter mitochondria (Fig. 2). 
In mammalian cells, nucleoids are similarly localized at mitochondrial 
division sites and mitochondrial tips’””, and in the absence of the divi- 
sion DRP DRP1, nucleoids aggregate in clusters within hyperfused 
mitochondria”. This suggests that ERMD’s role in nucleoid distribution 
is conserved, although the molecular identity of the ER-mitochondrial 
tether or tethers at division sites in mammalian cells is unknown. More 
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b, Molecular model for division site placement coupled to nucleoid segregation 
in yeast mitochondria. In yeast, the ER-mitochondria tethering complex, ER- 
mitochondrial encounter structure (ERMES), and the conserved Miro GTPase 
Gem] are spatially and functionally linked to ER-associated mitochondrial 
division (ERMD). ERMD sites marked by these components are also spatially 
localized to a subset of nucleoids that are actively replicating, and these segregate 
before ERMD. Gem1 acts as a negative regulator of ERMES-dependent 
ER-mitochondria contacts and is required for the spatial resolution of newly 
generated mitochondria by ERMD, possibly through the localized recruitment 
of the actin cytoskeleton. Cytoskeletal components may also participate in 
ER-associated mitochondrial constriction before DRP recruitment. Nucleoid 
placement at sites of ERMD could be mediated by a mark inside the organelle 
formed by the scaffold MitOS complex. Mdm33 is a possible candidate for the 
internal membrane scission machine in yeast. 
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importantly, the question of what determines the placement of divi- 
sion sites still needs to be answered. Specifically, it is not known what 
determinants are required for the finite number and spatial localization 
of ER-mitochondria contact sites linked to nucleoid segregation. These 
questions are related to whether, in a manner analogous to bacterial 
FtsZ, there is a machine inside mitochondria that facilitates mitochon- 
drial division. In yeast, an excellent candidate for the internal membrane 
scission machine is the inner membrane protein Mdm33, which pos- 
sesses matrix-localized coiled-coil regions that could act in trans across 
inner membranes to mediate constriction”. 

Given the endosymbiotic origin of mitochondria, it is tempting to 
speculate that a spatial mark inside the organelles is used as a division- 
plane placement determinant. Evidence exists for an autonomous struc- 
ture in mitochondria similar to the DNA-replicating replisome, which 
may function as such a mark. Specifically, nucleoid proteins required 
for mtDNA maintenance remain localized to discrete punctate struc- 
tures within mitochondrial tubules in the absence of mtDNA in yeast 
and mammalian cells, suggesting that they have an intrinsic ability to 
organize into a structure in a mtDNA-independent manner**™”. In yeast, 
replisomes segregate within mitochondria and maintain their associa- 
tion with ERMES-marked ER-mitochondria contact sites, even in the 
absence of mitochondrial genomes”. 

Mitochondrial skeletal structures may also serve as internal spatial 
marks. Although bacterial-like cytoskeletal elements have apparently 
been lost, many scaffold-like structures exist within mitochondria, 
facilitating the formation of their complex external and internal struc- 
ture. Mitochondrial scaffolds work together to create a higher-level 
organelle organization, which could encode spatial marks for nucleoid 
and/or division site placement. These scaffolds include the conserved 
prohibitin complex, which forms ring-like structures in the inner 
membrane that function together with mitochondrial lipids, such as 
cardiolipin and phosphatidylethanolamine, to organize inner mem- 
brane domains™. Another primary ‘skeletal’ element in mitochondria 
is the conserved multisubunit inner-membrane-associated complex 
MitOS (also called MICOS and MINOS)***’. Evidence indicates that 
MitOS forms an extended heteromorphic structure that organizes 
and potentially shapes the mitochondrial inner membrane, which 
is differentiated into regions that are structurally, compositionally 
and functionally distinct. The region in close apposition to the outer 
membrane, termed the boundary region, facilitates lipid trafficking, 
mitochondrial protein import and respiratory complex assembly. Inner 
membrane cisternal invaginations called cristae house assembled res- 
piratory complexes and have highly curved edges that are stabilized 
by the dimerization or multimerization of ATP synthase complexes*. 
Relatively narrow tubules, termed crista junctions, connect cristae to 
the boundary membrane and segregate soluble intermembrane-space 
components from the boundary regions. These junctions are restruc- 
tured in apoptosis to promote the release of intermembrane-space- 
localized cell-death mediators into the cytosol during mitochondrial 
outer-membrane permeabilization (MOMP)”. Super-resolution imag- 
ing has revealed that mammalian nucleoids are tightly associated with 
inner membrane cristae”. Thus, the MitOS complex may also have a 
direct role in nucleoid positioning and/or may be part of a spatial mark 
that links the inside of mitochondria to the outside. Consistent with 
this possibility, in yeast, elements of the MitOS complex seem adjacent 
to nucleoids, and loss of an intact MitOS complex leads to nucleoid 
aggregation®. This complex also facilitates mitochondrial biogenesis 
by interacting with components of the import and sorting machineries 
in the outer mitochondrial membrane®. Thus, MitOS may have a more 
global function as a ‘blueprint’ of mitochondrial organization and in 
this capacity could act as a form-to-function integrator. 


The ERMD microdomain as a systems integrator 

Although the fundamental role of ER-associated division may be to con- 
trol the distribution of mtDNA, evidence suggests that ERMD domains 
are harnessed for additional and diverse functions in cells and thus may 
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also function as integrators. In addition to ERMD, the ERMES complex 
has been functionally linked to the biogenesis of outer membrane pro- 
teins and to lipid transport between the ER and mitochondria, which 
is crucial for the synthesis of essential mitochondrial lipids, such as 
phosphatidylethanolamine and cardiolipin’. ERMD domains thus 
might also monitor cellular status by facilitating communication between 
mitochondrial behaviours and cellular signalling pathways, for example, 
between mitochondrial division, fusion and cell death. Consistent with 
this possibility is the fact that DRP1 facilitates the recruitment and activa- 
tion of the pro-apoptotic Bcl-2 protein BAX to the mitochondrial outer 
membrane to mediate MOMP™”. Furthermore, ER-synthesized sphin- 
golipids promote mitochondrial assembly of BAX and MOMP activation 
in vitro’. Conversely, mitochondrial fusion has a negative regulatory role 
in MOMP because cytosolic BAX promotes mitochondrial fusion in vitro 
through the DRP MEN2 (ref. 92), raising the possibility that fusion DRPs 
also regulate apoptosis through ERMD domains. It is possible that 
ERMD domains extend both into inner mitochondrial compartments 
and the ER lumen to integrate the functional status of both organelles 
as suggested by the regulation of MOMP by ER stress. Consistent with 
this, when ER stress occurs, the apoptotic regulator CDIP1 interacts with 
the ER protein BAP31, which subsequently leads to BAX assembly on 
mitochondria”. Thus, it will be interesting to determine whether sites of 
BAP31 and CDIP1 interaction coincide with sites of ERMD. The connec- 
tion between the ER and mitochondria is substantiated by their roles in 
a shared set of diseases associated with altered mitochondrial dynamics. 
For example, proteins localized to ER-mitochondrial contacts have been 
implicated in Huntington's disease, optic atrophy and spinocerebellar 
ataxias™; and alteration of ER-mitochondrial contact has been described 
in Alzheimer’s disease”*”’. Thus, ERMD domain dysfunction may be a 
contributory factor in many diseases. 

ERMD domains represent only one type of ER-mitochondrial con- 
tact. In yeast, for example, the ER is a component of two distinct Num1 
and Mmr] tethers that selectively position mitochondria at the cortex of 
mother and daughter cells, respectively, in a manner that is independent of 
ERMES and ERMD””® In addition, the fusion DRP MEN2, which is not 
essential to ERMD contact formation, has been proposed to act in mam- 
malian cells as an ER-mitochondrial tether™”’. More work is needed, 
especially in mammalian cells, to understand the molecular basis of ER- 
mitochondria contacts and whether specialized contacts and tethers exist 
for different shared ER-mitochondrial functions, such as Ca”* homeo- 
stasis, lipid biosynthesis, ERMD and for the ER-mitochondrial contacts 
implicated in autophagy’*”’””. This is an exciting area of mitochondrial 
biology that promises to be highly relevant to our understanding of the 
aetiology underlying diseases linked to mitochondrial dysfunction. 


Coordination of diverse mitochondrial behaviours 
Although mitochondrial division and fusion are major determinants of 
mitochondrial distribution, the behaviour of the mitochondrial network 
in cells is controlled by additional activities, such as tethering and motil- 
ity. Neuronal cells are a prominent example of how these behavioural 
networks must work together responsively to maintain cellular function. 
Neurons are long, excitable cells that are highly compartmentalized and, 
for proper function, mitochondria must be appropriately distributed to 
serve the cells’ different spatial and temporal demands. The demand for 
mitochondrial ATP production and Ca” buffering is especially high at 
axon terminals, which are dynamic structures that require the localized 
presence of mitochondria for synaptic transmission. Given that most 
mitochondrial biogenesis occurs in the soma of the neuron, active mecha- 
nisms are required to both transport and immobilize mitochondria at the 
distal synaptic terminals. 

Insight into how these two processes are coordinated and function 
together to selectively target mitochondria to active synaptic terminals 
has come from recent studies on the neuron-specific protein syntaphilin, 
which binds specifically to the mitochondrial outer membrane and accu- 
mulates on immobilized axonal mitochondria localized to active termi- 
nals’. Mitochondria destined for axons are generated by mitochondrial 
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division in the soma and transported to the synapse along microtubules. 
The spatial link between division and nucleoids, and nucleoids and 
cytoskeletal elements, might, in this cell type, ensure that mitochon- 
dria destined for transport contain mtDNA. Syntaphilin functions as 
a brake, using at least two separate mechanisms. It binds directly to the 
microtubule-based kinesin motor KIF5 in vitro and inhibits its motor 
activity, suggesting that it converts KIF5 into a component of a static 
microtubule-dependent mitochondrial tether. Whether the ER plays a 
part in the biogenesis of the syntaphilin-KIF5 tether is an outstanding 
question. Syntaphilin also competes for KIF5 binding with the adaptor 
Milton/TRAK to facilitate tethering. Thus, there is extensive interplay 
between the motility and tethering machines to control mitochondrial 
distribution in an activity dependent and spatially specific manner. 

Mitochondrial dynamics are probably also coordinated on a molecu- 
lar level with the transport and tethering machineries in different cell 
types. In mammalian cells, mitochondria that lack the fusion DRP 
MEN2 show severe motility defects’, similar to the motility defects 
observed in cells that lack Miro, and MFN2 has been reported to physi- 
cally interact with Miro and Milton/TRAK™. This link between mito- 
chondrial fusion and motility may be relevant for understanding why 
in humans, mutations in MFN2 and OPA1 cause the tissue-specific 
neurodegenerative diseases, Charcot Marie Tooth Type 2A (CMT2A) 
and dominant optic atrophy (DOA), respectively”. Indeed, a high fre- 
quency of mutations in mtDNA and nuclear genes that cause mitochon- 
drial dysfunction selectively affect neurons and cause a diverse set of 
neurodegenerative diseases”. 


Mitochondrial quality control pathways 

As well as being integrated with each other, mitochondrial behaviours 
are integrated with a battery of stress or quality-control pathways in cells 
that sense and respond to mitochondrial and cellular dysfunction (Fig. 3). 
Inside mitochondria, molecular chaperones and quality control proteases 
act together to promote the assembly of protein complexes comprised of 
mtDNA- and nucleus-encoded proteins as well as to monitor and degrade 
unfolded proteins”. An imbalance between nuclear and mitochondrial 
proteomes and/or an accumulation of unfolded mitochondrial proteins 
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Figure 3 | Integration of mitochondrial stress response pathways and their 
coordination with mitochondrial shape. Several different mitochondrial 
pathways respond to stress or damage and are coordinated with mitochondrial 
dynamics. Mitochondria in healthy cells generate an electrochemical potential 
that serves in oxidative phosphorylation and drives the import of proteins 

into the organelle. Damage that leads to a loss (blue) of mitochondrial 
membrane potential can lead to a loss of protein import efficiency. In the 
unfolded protein stress response (UPRmt) pathway, loss of import leads to 

the accumulation of the transcription factor ATFS1 in the nucleus, activating 

a transcriptional mitochondrial repair and metabolic adaptation response. 
Loss of membrane potential also triggers the OMA1-dependent proteolysis 

of long isoforms of the inner membrane fusion DRP OPA1, which attenuates 
mitochondrial fusion and potentially increases ER-mediated mitochondrial 
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triggers a transcriptional response program in metazoans, termed the 
mitochondrial unfolded protein stress response pathway (UPRmt)"°""’. 
The response is initiated by signals produced at the mitochondrial level 
that activate the transcription of nucleus-encoded mitochondrial chap- 
erone genes, as well as additional genes to restore organelle homeostasis. 
The pathway has been characterized at the molecular level in Caenorhab- 
ditis elegans, for which both the mitochondrial-inner-membrane peptide 
transporter HAF] and the bZip transcription factor ATFS1 are required 
for UPRmt signalling. Recent evidence indicates that in healthy cells, 
ATFS1 is actively imported into the mitochondrial matrix, where it is 
constitutively degraded in healthy mitochondria’®. Under conditions 
where the electron transport chain is disrupted, membrane potential is 
attenuated, and consequently the import efficiency of ATFS1 is decreased 
in amanner that is somehow dependent on HAF 1. Extra-mitochondrial 
ATFS1 is stabilized and targeted to the nucleus, where it initiates a tran- 
scriptional response that increases the expression of mitochondrial chap- 
erones and import machinery and remodels metabolism to rely less on 
respiration. Activation of the UPRmt in C. elegans is associated with an 
increase in lifespan, and recent evidence suggests that in mammals acti- 
vation of this pathway also contributes to longevity, further implicating 
mitochondria as a crucial factor in ageing’®*. It remains to be determined 
whether the molecular mechanisms underlying the UPRmt are conserved 
in mammalian systems. 

Additional mitochondrial stress-induced pathways are triggered by 
perturbations in electron transport chain function and/or reduction of 
membrane potential. The mitochondrial inner membrane fusion DRP 
OPA1 acts as a toggle between two pathways. In healthy cells, OPA1 pro- 
cessing occurs constitutively by the i-AAA protease YMEIL to generate 
long transmembrane anchored and short, soluble isoforms, which are 
required for membrane fusion'"'. Reduction of mitochondrial membrane 
potential causes the long OPA1 isoforms to be converted into short forms 
by the metalloprotease OMA1, resulting in the inhibition of mitochon- 
drial fusion and subsequent mitochondrial fragmentation’. This 
alteration serves to potentiate mitophagy and/or cell death. Conversely, 
long OPA1 isoforms are required for a different stress-induced response, 
termed mitochondrial hyperfusion. Mitochondrial hyperfusion mediates 
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division (ERMD), resulting in mitochondrial fragmentation. These 
fragmented mitochondria that have lost the ability to respire and import may 
also accumulate the PINK] kinase, which triggers mitophagy. In addition, 
under these conditions, the ERMD domain may be altered to directly 
promote BAX oligomerization on the mitochondrial outer membrane, 
outer-membrane permeabilization and cytochrome ¢ release, leading to 

cell death. Mitochondrial dysfunction and stresses such as starvation can 
trigger mitochondrial hyperfusion, which is dependent on maintenance 

of mitochondrial membrane potential (red) and the presence of both long 
and short OPA1 isoforms. Hyperfused mitochondria can transiently buffer 
the effects of respiratory chain dysfunction and do not enter the mitophagy 
pathway. The relationship between UPRmt and mitochondrial shape has not 
been explored. 
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the formation of a highly connected mitochondrial network and is thus 
thought to buffer the potentially deleterious effects of stresses, includ- 
ing those caused by ultraviolet irradiation and nutrient starvation. In the 
case of starvation, hyperfusion has been proposed to protect mitochon- 
dria from autophagic degradation or mitophagy through steric hinder- 
ance’'*"°. More recent work indicates that mitochondrial hyperfusion 
also serves as a homeostatic response to maintain ATP production in 
cases where complex IV of the electron transport chain is impaired”. 
The hyperfusion response, however, is transient and thus cannot buffer 
long-term defects in electron transport chain activity. A more terminal 
response to mitochondrial dysfunction is mitophagy, which is also trig- 
gered bya decrease in membrane-potential-driven protein import. In 
this pathway, the kinase PINK] is imported into healthy mitochondria 
and constitutively degraded. A decrease in import triggered by mito- 
chondrial dysfunction causes PINK] to accumulate on the outer mem- 
brane, where it recruits the E3 ligase Parkin''*'"”. Among Parkin’s targets 
for ubiquitination are the transport factor Miro and the mitochondrial 
fusion protein MFN2 (ref. 118-120). The proteasomal degradation of 
ubiquitinated mitochondrial outer membrane proteins is dependent on 
the AAA-ATPase p97 in a manner analogous to ER-associated degra- 
dation'*”""". The Parkin-dependent degradation of factors involved in 
mitochondrial motility and fusion enhances the selectivity of removing 
defective mitochondria by autophagy. In addition to specifying defective 
mitochondria for degradation, an in vivo study in Drosophila suggests 
that the PINK1-Parkin pathway may also be capable of selectively target- 
ing respiratory complexes for degradation’. In support of this idea, the 
selective targeting of complex I for degradation has also been described in 
cell culture models”, but the mechanisms underlying this phenomenon 
are currently unknown. 

Mitochondrial stress pathways probably have important roles in dis- 
ease manifestations of mitochondrial dysfunction and, as such, could 
highlight promising therapeutic targets. In the case of the PINK1-Parkin 
mitophagy pathway, mutations in each gene are linked to genetically 
inherited Parkinson’s disease’**"™, implying the pathway is relevant. 
However, most of the stress pathways described above, particularly those 
caused by a loss of membrane potential, have been characterized under 
conditions of acute extreme stress, raising the question of their physio- 
logical relevance’”’. In addition, despite these numerous stress pathways, 
mtDNA mutations are able to accumulate, particularly in differentiated 
post-mitotic cells, at the expense of functional wild-type mtDNA, leading 
to disease states. Thus, future work should focus on developing animal 
models that mimic diseases associated with mitochondrial dysfunction 
to assess the physiological contributions of these pathways. In addition, 
continued basic biological approaches are necessary to assess how the cell 
appropriately senses, and differentially activates and coordinates these 
pathways with each other and with other signalling pathways, such as 
those involved in cell death (Fig. 3). For example, it is unclear how cells 
appropriately integrate the UPRmt and mitophagy pathways, which are 
both regulated at the basic level of import. It is also unknown whether 
OPA 1 can function as a molecular integrator of stresses or simply as a 
modulator of mitochondrial shape or how OPA1-dependent stress path- 
ways are coordinated with UPRmt and mitophagy. 

The emerging picture of mitochondria is that of ‘super-organized’ 
structural domains for building an organelle whose behaviour is 
wired to be responsive to cellular needs, as well as its own dysfunc- 
tion. The combined use of system-based approaches, with super-res- 
olution microscopy and new genetic tools will allow us to understand 
the molecular basis of mitochondrial structure in detail. Exactly how 
mitochondrial super-organization is constructed will address the fun- 
damental question of whether primary determinants of organization 
originate from the inside of the organelle and are intimately tied to the 
most ancient feature — the genome. m 
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The multilayered complexity of 
ceRNA crosstalk and competition 


Yvonne Tay’, John Rinn*** & Pier Paolo Pandolfi' 


Recent reports have described an intricate interplay among diverse RNA species, including protein-coding messenger 
RNAs and non-coding RNAs such as long non-coding RNAs, pseudogenes and circular RNAs. These RNA transcripts 
act as competing endogenous RNAs (ceRNAs) or natural microRNA sponges — they communicate with and co-regulate 
each other by competing for binding to shared microRNAs, a family of small non-coding RNAs that are important post- 
transcriptional regulators of gene expression. Understanding this novel RNA crosstalk will lead to significant insight into 
gene regulatory networks and have implications in human development and disease. 


scription across 70-90% of the human genome. This was particu- 

larly surprising because less than 2% of the total genome encodes 
protein-coding genes, suggesting that non-coding RNAs represent most 
of the human transcriptome. Recent reports indicate that aside from 
around 21,000 protein-coding genes, the human transcriptome includes 
about 9,000 small RNAs, about 10,000-32,000 long non-coding RNAs 
(IncRNAs) and around 11,000 pseudogenes'*. Non-coding transcripts 
can generally be divided into two major classes on the basis of their size. 
Small non-coding RNAs have been relatively well characterized, and 
include transfer RNAs, which are involved in translation of messenger 
RNAs; microRNAs (miRNAs) and small-interfering RNAs, which are 
implicated in post-transcriptional RNA silencing; small nuclear RNAs, 
which are involved in splicing; small nucleolar RNAs, which are impli- 
cated in ribosomal RNA modification; PIW]-interacting RNAs, which 
are involved in transposon repression; and transcription initiation RNAs, 
promoter upstream transcripts and promoter-associated small RNAs, 
which may be involved in transcription regulation. IncRNAs can vary in 
length from 200 nucleotides to 100 kilobases, and have been implicated 
in a diverse range of biological processes from pluripotency to immune 
responses’. One of the best-studied and most dramatic examples is XIST, 
a single RNA gene that can recruit chromatin-modifying complexes to 
inactivate an entire chromosome during dosage compensation’. However, 
although thousands of IncRNAs have been identified in the past decade, 
only a small number have been functionally characterized. 

The importance of the non-coding transcriptome has become 
increasingly clear in recent years — comparative genomic analysis has 
demonstrated a significant difference in genome utilization among 
species (for example, the protein-coding genome constitutes almost 
the entire genome of unicellular yeast, but only 2% of mammalian 
genomes)’, and the non-coding transcriptome is often dysregulated 
in cancer®. These observations suggest that the non-coding transcrip- 
tome is of crucial importance in determining the greater complexity of 
higher eukaryotes and in disease pathogenesis’*. Functionalizing the 
non-coding space will undoubtedly lead to important insight about 
basic physiology and disease progression. 

Although many reviews have focused on the regulatory mechanisms 
and functions of IncRNAs*”, there are many additional implications 
for the pervasive transcription observed in mammalian genomes. In 


|: recent years, numerous studies have documented pervasive tran- 


this Review, we focus on the emerging roles of RNA-RNA crosstalk, 
which include new layers of gene regulation that involve interactions 
between diverse RNA species. A classic example of RNA-RNA interac- 
tions involves the post-transcriptional regulation of RNA transcripts by 
miRNAs. As our knowledge of the transcriptome space has expanded, 
it has become increasingly clear that numerous miRNA-binding sites 
exist on a wide variety of RNA transcripts, leading to the hypothesis that 
all RNA transcripts that contain miRNA-binding sites can communi- 
cate with and regulate each other by competing specifically for shared 
miRNAs, thus acting as competing endogenous RNAs (ceRNAs)'°". 
miRNA competition thus extends beyond the non-coding transcrip- 
tome and potentially confers an additional non-protein-coding function 
to protein-coding mRNAs. Although it has been proposed that cCRNA 
crosstalk may be limited to a small subset of transcripts, owing to fac- 
tors such as miRNA abundance, ceRNA abundance and subcellular 
localization’””’, the discovery of functional ceRNA regulation in diverse 
species — including viruses, plants, mice and humans — by multiple 
independent groups suggests that it may represent a widespread layer 
of gene regulation’*"'*. We discuss literature describing the effect of 
miRNA competition on the regulation of both non-coding and coding 
RNAs, additional factors that may affect ceRNA activity and potential 
directions for future studies, as well as the implications of miRNA com- 
petition for development and disease. 


RNA crosstalk in competitive endogenous networks 
Competition between various molecular species to bind to a specific 
molecular target has been described in multiple contexts, and includes 
DNA-protein, RNA-protein, RNA-RNA and protein-protein crosstalk. 
This Review focuses on crosstalk involving RNA-RNA interactions and 
an additional regulatory layer of RNA-protein interactions. 


Protein-RNA competition 

RNA-binding proteins (RBPs) are key regulators of multiple post- 
transcriptional events, including RNA splicing, stability, transport 
and translation’. Various RBPs may compete to bind to specific 
target transcripts: the RBP HuR, which generally stabilizes target 
transcripts; and AUF1, which generally leads to the rapid degrada- 
tion of target transcripts, have been shown to compete for common 
target binding sites"; and competition between HuR and wild-type 
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TTP for binding to the HuR (also known as Elavl1) transcript has 
been implicated in HuR regulation and cytoplasmic localization”. 
Conversely, target transcripts may compete to bind to specific RBPs: 
competition between GAP43 and $-actin mRNAs for binding to the 
K-homology (KH)-domain RBP ZBP1 has been shown to affect their 
exonal localization”. 


miRNA-RNA competition 

In addition to the protein-coding dimension, molecular competition 
also extends to regulatory networks comprised exclusively of RNAs, 
suggesting that sequence competition represents a universal and preva- 
lent form of gene regulation. Two intriguing new players, which add 
further complexity to RNA crosstalk, are miRNAs and IncRNAs. Exper- 
imental evidence has confirmed that competition for miRNAs, small 
non-coding regulators of gene expression, plays an integral part in the 
regulation of both IncRNAs'*'°** and mRNAs'”"**"”, We summarize 
the background and experimental evidence that supports this competi- 
tive RNA crosstalk and discuss potential refinements and directions for 
future analyses. 


Artificial miRNA sponges as miRNA competitors 
Several years before the discovery of naturally occurring miRNA 
sponges, or ceRNAs, various groups described the use of artificial 
miRNA sponges as effective miRNA inhibitors ~*°. These sponges are 
usually expressed from strong promoters, contain multiple binding sites 
for an miRNA of interest and have been shown to derepress miRNA 
targets at least as effectively as chemically modified antisense oligonu- 
cleotides*’. The efficacy of artificial sponges has been demonstrated for 
multiple miRNAs both in vitro and in vivo". Artificial miRNA sponges 
constructed with multiple binding sites for different miRNAs may also be 
used to study the effect of several miRNAs simultaneously”. 
Intriguingly, although sponges with perfectly complementary 
miRNA-binding sites have been shown to be effective”**”, ‘bulged 
sponges, which include a central bulge and hence bind miRNAs with 
imperfect complementarity, have been demonstrated to sequester 
miRNAs with greater efficacy*’******. This may be partly due to the 
fact that, unlike perfectly complementary targets, imperfect targets are 
not immediately degraded and are thus able to reduce miRNA bio- 
availability until the mRNA is destabilized by other factors*’**. These 
artificial miRNA sponges will not only be invaluable tools for miRNA 
loss-of-function studies in vitro and in vivo, but may also provide a new 
platform for RNA-based therapeutic applications. 


Table 1 | List of validated non-coding competing endogenous RNAs 


REVIEW 


Natural miRNA sponges as ceRNAs 

It has been proposed that naturally occurring protein-coding and 
non-coding RNA transcripts can act as endogenous miRNA sponges, 
or ceRNAs'?>. ceRNAs communicate with, and co-regulate, each 
other by competing to bind to shared miRNAs, thereby titrating 
miRNA availability. Because ceRNA crosstalk can be deciphered bio- 
informatically and experimentally, the identification and analysis of 
ceRNA interactions may allow the systematic functionalization of the 
non-coding transcriptome, as well as a non-protein-coding function 
to be attributed to mRNA transcripts, which may be complementary 
or even distinct from their protein-coding function”. 


Non-coding RNAs as ceRNAs 

Increasing experimental evidence supports the hypothesis that mul- 
tiple non-coding RNA species, including small non-coding RNAs, 
pseudogenes, IncRNAs and circular RNAs (circRNAs) may pos- 
sess ceRNA activity. 


Plant and viral non-coding ceRNAs 

One of the first examples of endogenous non-coding miRNA sponges 
was described in plants'* (Table 1). The non-coding RNA IPS1 from 
Arabidopsis thaliana has been reported to alter the stability of PHO2 
mRNA by sequestering the phosphate starvation-induced miRNA 
miR-399. In addition, although most miRNA targets in plants are 
cleaved owing to almost perfect miRNA complementarity, the IPS1 
motif contained a mismatched loop at the miRNA cleavage site that 
abolished transcript cleavage and resulted in effective miR-399 seques- 
tration. Generation of a cleavable JPS1 variant eliminated its inhibitory 
activity on miR-399. This was consistent with the observations from 
artificial sponge constructs suggesting that imperfectly complemen- 
tary ‘bulged sponges’ sequester miRNAs more effectively than perfectly 
complementary miRNA sponges. 

Intriguingly, the primate virus Herpesvirus saimiri has been shown 
to use a non-coding ceRNA to control host-cell gene expression”. 
H. saimiri-transformed T cells express several viral non-coding 
RNAs of unknown function called H. saimiri U-rich RNAs (HSURs). 
One of these non-coding RNAs, HSURI, has been found to contain 
miR-27-binding sites and direct miR-27 degradation in a sequence- 
specific and binding-dependent manner. The expression of HSUR1 and 
HSUR2 was shown to correlate with an upregulation of FOXO1 levels, a 
validated miR-27 target, suggesting that perturbation of miRNA expres- 
sion by HSURs was able to control host gene expression. 


Non-coding RNA species Competing endogenous RNAs Shared microRNAs Organism Reference 
Small non-coding RNA HSUR1 FOXO1 miR-27a Herpesvirus saimiri 14 
Long non-coding RNA IPS1 PHO2 miR-399 Arabidopsis thaliana 15 
HULC PRKACB miR-372 Homo sapiens 25 
linc-MD1 MAML1 miR-133 Mus musculus and Homo sapiens 24 
MEF2C miR-135 
linc-RoR NANOG miR-145 Homo sapiens 43 
OCT4 
SOX2 
PTCSCS3 miR-574-5p Homo sapiens 41 
H19 Let-7 family Mus musculus and Homo sapiens 42 
Pseudogene PTENP1 PTEN miR-17, miR-19, miR-21, miR-26 and Homo sapiens 16 
miR-214 families 
KRAS1P KRAS Let-7 family 
Pbcas4 BCAS4 miR-185 Mus musculus and Homo sapiens 39 
Circular RNA CDR1as/ciRS-7 miR-7 Danio rerio, Mus musculus and Homo sapiens 45, 46 
Sry miR-138 Mus musculus and Homo sapiens 45 
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Figure 1 | PTEN competing endogenous RNA (ceRNA) network. Validated 
interactions (grey arrows) between ceRNAs have been reported, shown 
here with the respective validated microRNAs (miRNAs). Studies have also 


Pseudogene ceRNAs 

In humans, the non-coding PTENP1 pseudogene has been reported 
to regulate levels of its cognate gene, the tumour suppressor PTEN, by 
competing for shared miRNAs’ (Fig. 1). PTENP1 has been shown to 
act as a decoy for PTEN-targeting miRNAs, to possess tumour suppres- 
sive activity and to be selectively lost in human cancers. An additional 
dimension to PTENP1-mediated ceRNA regulation was recently out- 
lined*’, whereby two antisense RNAs encoded by the PTENP1 locus a 
and B were characterized. The 6 isoform was shown to interact with the 
PTENP1 transcript through an RNA-RNA pairing interaction, affecting 
its stability and ceRNA activity”. 

In addition, overexpression of the KRAS pseudogene KRASIP 
3’ untranslated region (UTR) has been reported to increase KRAS tran- 
script abundance and accelerate cell proliferation. The KRAS1P locus 
was amplified in various human cancers, and KRAS1P transcript levels 
positively correlated with KRAS transcript levels in prostate cancer. 
Taken together, these findings were particularly relevant because they 
attributed a new function to transcribed pseudogenes, IncRNAs, which 
have been considered to be biologically inconsequential owing to their 
inability to be translated into functional proteins. Although transcribed 
pseudogenes may be expressed at much lower levels than their cognate 
genes, this is counterbalanced by their high degree of shared sequence 
homology, which results in the conservation of multiple miRNA-bind- 
ing sites and allows them to compete for the binding of many shared 
miRNAs simultaneously. Furthermore, it has been suggested that RNA 
transcripts that contain premature stop codons, such as pseudogenes, 
may be subjected to nonsense-mediated mRNA decay”. This rapid 
turnover may conceivably lead to their low abundance, as well as the 
increased degradation of bound miRNAs, enhancing their effectiveness 
as miRNA sponges. 

Another example of a pseudogene ceRNA was described by 
Marques et al., who focused their attention on ‘unitary pseudogenes, 
which retain protein-coding capability in the human lineage but lose 
protein-coding function in the rodent lineage. This allowed the RNA- 
mediated function of RNAs to be disentangled from their ancestral 
protein-coding function. The mouse Phcas4 transcript was validated 
as a conserved ceRNA for human BCAS4, which downregulated expres- 
sion levels of the protein-coding transcripts Bcl2, I117rd, Pnpla3, Shisa7 
and Tapbp by acting as a miR-185 decoy”. The preservation of miRNA 
response elements (MREs) and thus miRNA sponge function of this 
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identified potential ceRNA interactions (blue arrows) based on the respective 
validated miRNAs. Validated PTEN ceRNAs described in ref. 26 are not 
included because the specific miRNAs were not identified. 


unitary pseudogene after the loss of its protein-coding function lends 
support to the hypothesis that ceRNA interactions represent a con- 
served and biologically relevant mechanism of post-transcriptional 
gene regulation, which confers an additional non-protein-coding role 
to protein-coding transcripts. 


IncRNA ceRNAs 

The complexity and diversity of potential ceRNA interactions have 
scaled exponentially with the identification of more than 10,000 
IncRNAs. Examples are already emerging of IncRNAs as competitive 
platforms for both miRNAs and mRNAs. HuR-mediated binding of the 
miRNA let-7 to the long intergenic non-coding RNA lincRNA-p21 was 
shown to lead to decreased lincRNA-p21 transcipt levels and a concomi- 
tant increase in the translation of the lincRNA-p21-associated mRNAs 
JUNB and CTNNBI (ref. 40). 

Many IncRNAs are known to have low abundance and/or nuclear 
localization’, which may adversely affect their effectiveness as ceRNAs 
in steady state conditions. However, it has been shown that thousands 
of IncRNAs possess cell type-, tissue type-, developmental stage- and 
disease-specific expression patterns and localization’, suggesting that 
individual IncRNAs may be potent natural miRNA sponges in certain 
settings. Two such examples of IncRNA ceRNAs that have emerged, 
underscoring the importance of these miRNA-IncRNA competitive 
interactions, are HULC and PTCSC3. 

The IncRNA HULC has been identified as one of the most signifi- 
cantly upregulated transcripts in hepatocellular carcinoma (HCC)”. 
The HULC transcript was found to contain miR-372-binding sites, 
and its overexpression could reduce miR-372 expression and activ- 
ity in the liver cancer cell line Hep3B. HULC’s ceRNA activity forms 
part of an intricate autoregulatory loop: miR-372 inhibition by HULC 
reduces translational repression of its target transcript PRKACB, induc- 
ing cAMP response element binding protein (CREB) phosphorylation 
and enhancing CREB-dependent HULC upregulation in liver cancer. 
Significantly, this observed interaction extended beyond in vitro cell- 
line data to human clinical samples: miR-372 expression was downregu- 
lated in all HCC tissue samples compared with corresponding adjacent 
normal tissue, consistent with increased HULC expression. 

In contrast to HULC, which is upregulated in HCC, PTCSC3 was 
found to be dramatically downregulated in thyroid cancers. PTCSC3 
transfection has been shown to lead to a significant decrease in 
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expression levels of the oncogenic miR-574-5p, as well as growth inhibi- 
tion, cell-cycle arrest and increased apoptosis in three thyroid cancer cell 
lines*’. These two studies suggest that both the disease-specific upregu- 
lation and downregulation of individual IncRNAs, and the resultant 
changes in ceRNA-mediated interactions, may have profound effects 
in pathophysiological conditions. 

In addition to their roles in various cancers, ncRNA ceRNAs have 
been implicated in human development. For example, a muscle-specific 
IncRNA, lincMD1, which is activated on myoblast differentiation and con- 
trols muscle differentiation in human and mouse myoblasts through its 
ceRNA activity, has been identified”. lincMD1 sequestered miR-133 and 
miR-135, effectively regulating the expression of MAML1 and MEF2C 
mRNAs, respectively. Importantly, when lincMD1 levels were in excess, 
the repression of MAML1 and MEF2C could be titrated by increasing 
expression levels of the respective miRNAs, confirming that the observed 
regulation was due to direct competition for those specific miRNAs. As 
MAMLI and MEF2C encode transcription factors known to activate 
muscle-specific gene expression, these data suggest that ceRNA regula- 
tion is of crucial importance in myogenic differentiation. In another study, 
developmentally regulated IncRNA H19 was found to contain binding 
sites for the let-7 miRNA family, and thus acted as an effective ceRNA 
for the very abundant let-7 miRNA, hence modulating the expression of 
other let-7 target transcripts including Dicer and Hmga? (ref. 42). The 
observation that let-7 overexpression could recapitulate the precocious 
muscle differentiation caused by H19 knockdown provides further evi- 
dence of the physiological relevance of ceRNA interactions in this setting. 

Sequestration of miRNAs has also been shown to be of functional 
importance in pluripotent embryonic stem (ES) cells. The IncRNA 
linc-RoR, which is abundantly expressed in human ES cells and down- 
regulated during differentiation, has been shown to share regulatory 
miRNAs with OCT4, SOX2 and NANOG, which are core transcription 
factors that are essential for ES cell self-renewal”. linc-RoR effectively 
sequestered miR-145, protecting OCT4, SOX2 and NANOG transcripts 
from miR-145-mediated suppression. Significantly, this regulation was 
abolished by the introduction of mutations in the two miR-145-binding 
sites in linc-RoR, providing further evidence that the observed effect was 
miR-145 dependent. As linc-RoR transcription is directly regulated by 
these core transcription factors, these results implicate it in a regulatory 
feedback loop in ES cells and suggest that its ceRNA function is essential 
for ES cell pluripotency and self-renewal. 


circRNA ceRNAs 

About 20 years ago, the predominant transcript of the testis-determin- 
ing gene Sry in mouse testis was found to be circular™, although the 
physiological relevance of these RNA circles remained elusive. The 
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recent discovery of competitive RNA-RNA interactions coupled with 
the extensive complementarity of circRNAs to their linear mRNA coun- 
terparts has raised the possibility that these RNA circles may have an 
integral role in regulatory RNA networks. Recently, a circRNA called 
CDR1as (also known as ciRS-7)*“° was identified. This highly stable 
circRNA contains more than 60 conserved binding sites for miR-7 and 
hence acts as an effective miR-7 sponge that affects miR-7 target gene 
activity. In zebrafish, its expression impaired midbrain development 
in a manner analogous to miR-7 knockdown*. However, this is not an 
isolated example of circRNAs with ceRNA activity, Sry has also been 
validated as a miR-138 sponge”. These studies represent the first func- 
tional analysis of circRNAs. 

Recent bioinformatic and experimental analyses have identified thou- 
sands of circRNAs in the mammalian transcriptome, suggesting that 
circRNAs may in fact represent a new class of ceRNA regulators*”. 
Importantly, owing to their high expression levels and increased sta- 
bility, circRNAs with ceRNAs activity may be exceptionally effective 
modulators of the crosstalk between linear ceRNAs™. circCRNAs such 
as CDR1as that contain multiple binding sites for the same miRNA 
may represent a mechanism for sequestering more abundant miRNAs, 
which would be significantly less susceptible to titration by transcripts 
that contain only one miRNA-binding site. In addition to their ceRNA 
function, it is possible that circRNAs may also bind and sequester RBPs, 
base pair with other RNAs or even produce proteins”. 

Taken together, these studies suggest that the analysis of cRNA cross- 
talk may provide invaluable insight into the function of diverse species 
of non-coding RNAs, including IncRNAs, pseudogenes and circRNAs. 
Furthermore, because any RNA transcript that contains miRNA-binding 
sites can sequester miRNAs, ceRNA crosstalk also extends beyond the 
non-coding space to confer a non-protein-coding function to mRNAs. 


mRNAs as ceRNAs 
The gene with perhaps the most extensively characterized cRNA 
network is the important tumour suppressor PTEN. Aside from the 
non-coding pseudogene PTENPI (discussed earlier), the PTEN cRNA 
network includes multiple protein-coding transcripts (Fig. 1, Table 2). 
A combined computational and experimental approach has been 
used to identify CNOT6L and VAPA as protein-coding transcripts that 
regulate PTEN transcript and protein expression in a Dicer-dependent 
manner, antagonize downstream PI(3)K signalling and possess growth- 
and tumour-suppressive properties’’. These genes were also coex- 
pressed with PTEN mRNA in several human cancers and displayed copy 
number loss in colon cancer. The study demonstrated that previously 
uncharacterized transcripts could be functionalized, partly through the 
identification of their ceRNA interactors, and presented a framework 


Competing endogenous RNAs Shared microRNAs Organism Reference 
PTEN  CNOT6L miR-17 and miR-19 families Homo sapiens 17 
VAPA miR-17, miR-19 and miR-26 families 
ZEB2 miR-25, miR-92a, miR-181 and miR-200b Homo sapiens and Mus musculus 18 
ABHD13, CCDC6, CTBP2, DCLK1, DKK1, HIAT1, Homo sapiens 26 
HIF1A, KLF6, LRCH1, NRAS, RB1, TAF5 and TNKS2 
PTEN  VCAN miR-136 and miR-144 Homo sapiens and Mus musculus 50 
Rb1 VCAN miR-144 and miR-199a-3p 
CD34 miR-133a, miR-144 and miR-431 51 
FN1 miR-133a, miR-199a-3p and miR-431 27,51 
FN1 CD44 miR-491, miR-512-3p and miR-671 Homo sapiens 53 
Collal CD44 miR-328 
CDC42 miR-216a, miR-330 and miR-608 52 
HMGA2_ TGFBR3 Let-7 family Homo sapiens and Mus musculus 54 
m169° m169 miR-27a and miR-27b Murine cytomegalovirus 55, 56 
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for the prediction and validation of ceRNA interactions that is widely 
applicable to any potential transcript of interest. 

One of the first examples linking aberrant ceRNA expression to 
tumorigenesis in vivo came from the discovery of a significant enrich- 
ment of predicted PTEN ceRNAs among genes whose loss accelerated 
melanomagenesis in a Sleeping Beauty insertional mutagenesis screen 
in vivo. ZEB2 was subsequently validated as a bona fide PTEN ceRNA 
that modulated PTEN protein levels in an miRNA-dependent, protein- 
coding-independent fashion’*. Decreased ZEB2 transcript levels acti- 
vated downstream signalling, enhanced cell transformation and were 
found to occur frequently in human melanoma and other cancers with 
low PTEN mRNA expression. These data suggest that the dysregulation 
of PTEN expression owing to the loss of its ceRNA ZEB2 contributes to 
the development of melanoma both in vitro and in vivo. 

Analysis of gene expression data in glioblastoma in combination 
with matched miRNA profiles validated 13 PTEN ceRNAs or miR 
program-mediated post-transcriptional regulatory (mPR) regulators 
whose locus deletions were predictive of decreased PTEN expression, 
downregulated PTEN in a 3’ UTR-dependent manner and increased 
tumour cell growth rates**. When the analysis was significantly 
extended beyond the binary ceRNA associations described in most 
other studies, the PTEN ceRNA interactions were found to be part of 
a post-transcriptional regulatory layer comprising more than 248,000 
miRNA-mediated interactions. 

The VCAN 3’ UTR has been reported to modulate PTEN levels by 
sequestering the shared miRNAs miR-144 and miR-136, freeing PTEN 
mRNA for translation®’. In addition to its role as a PTEN ceRNA, 
VCAN has been shown to modulate the expression of several other 
genes through miRNA competition (Fig. 1). VCAN was also demon- 
strated to act as a ceRNA for the cell-cycle regulator RB1 by regulat- 
ing miR-199a-3p and miR-144 levels, upregulating the expression of 
this crucial tumour suppressor both in vitro and in vivo”. CD34 and 
FNI1 were validated as two additional VCAN ceRNAs that commu- 
nicate through interactions with miR-133a, miR-199a-3p, miR-144 
and miR-431 (refs 27, 51). As overexpression of the VCAN 3’ UTR in 
vivo was shown to induce organ adhesion followed by hepatocellular 
carcinoma development at a later time point, these studies were the 
first to demonstrate that the perturbation of a non-coding RNA tran- 
script with validated ceRNA function has physiological and patho- 
physiological relevance in vivo. 

Another gene with validated protein-coding ceRNAs is CD44, 
which encodes a transmembrane glycoprotein that is involved in 
a wide range of cellular functions”. The CD44 3’ UTR was shown 
to inhibit cell proliferation, colony formation, tumour growth and 
enhance cell motility, invasion and adhesion™”*. CD44 regulated levels 
of CDC42, a Rho-GTPase involved in the control of cell morphology, 
migration and cell-cycle progression, by binding and sequestering 
three miRNAs, miR-216a, miR-330 and miR-608. It was also shown to 
modulate levels of Colla1 through miR-328 binding and FN1 through 
miR-512-3p, miR-491 and miR-671 binding. The ceRNA interactions 
between CD44 and FN1 link CD44, CDC42 and Colla] to the broader 
miRNA-ceRNA network revolving around PTEN and VCAN. This 
adds another level of complexity to these post-transciptional regula- 
tory associations between multiple transcripts, which play important 
parts in tumorigenesis (Fig. 1). 

A recent study outlined the role of Hmga2’s ceRNA function in 
promoting lung carcinogenesis”. Hmga2, a non-histone chromosomal 
high mobility group protein, was shown to be highly expressed in 
metastatic lung adenocarcinoma and to promote lung cancer progres- 
sion by sequestering the abundant let-7 family of miRNAs. The TGF-B 
co-receptor Tgfbr3 was identified as a putative Hmga2 ceRNA, and 
Tgfbr3-driven TGF-6 signalling was demonstrated to be largely nec- 
essary for Hmga2-mediated lung-cancer progression. The discovery 
that the primary function ofa protein-coding transcript is to act as 
an oncogenic ceRNA for a very abundant miRNA, largely indepen- 
dently of its protein-coding function, provides further support for 
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the hypothesis that the ceRNA function of multiple protein-coding 
transcripts is of fundamental importance in cancer progression. 

A viral mRNA that functions as a natural miRNA sponge in host 
cells has also been described. Intriguingly, this mRNA was found to 
sequester the same miRNA reported to be downregulated by HSUR1- 
binding in H. saimiri, suggesting that ceRNA-mediated regulation of 
this miRNA may be a conserved mechanism for controlling host gene 
expression in multiple viruses. Two studies independently showed that 
the highly abundant murine cytomegalovirus mRNA m 169 acted as 
a natural miRNA sponge in host cells by binding to miR-27 through 
a single site in its 3’ UTR, hence directing its degradation®**. Impor- 
tantly, m169-mediated miR-27 regulation is another example of the 
importance of ceRNA function in vivo: disruption or replacement of 
the miRNA target site resulted in significant viral attenuation in multiple 
organs, suggesting that this miRNA-sponge interaction is crucial for 
efficient replication in vivo. 


Predicting ceRNA crosstalk 

ceRNA crosstalk depends on the MREs located on each transcript, 
which combinatorially form the foundation of these co-regulatory 
interactions’. Prediction of ceRNA crosstalk is thus dependent on the 
identification of MREs on the relevant transcripts of interest. Several 
miRNA-target prediction algorithms, including TargetScan, miRanda, 
rna22 and PITA, have been successfully used to identify ceRNA interac- 
tions. However, miRNA target identification is challenging owing to the 
imperfect nature of base pairing between an miRNA and its target, and 
the rules of targeting are not completely understood”. Further develop- 
ment and optimization of these algorithms will undoubtedly improve 
subsequent predictions of ceRNA interactions. 

A database of predicted ceRNA interactions (ceRDB), which was gen- 
erated by examining the co-occurrence of MREs on a genome-wide 
basis™, has recently been compiled. Predicted miRNA-mRNA target 
interactions were obtained from TargetScan release 5.2, and interac- 
tion scores were defined for each mRNA by adding the total number 
of MREs that overlap with the miRNAs for the mRNA of interest; 
these interaction scores were then used to rank the predicted potential 
ceRNAs. Although this analysis is limited to the 3’ UTRs of protein- 
coding transcripts, it still represents a useful tool for the identification 
of putative ceRNA crosstalk. 

As an alternative to in silico prediction strategies, recently developed 
high-throughput biochemical techniques, which identify endogenous 
miRNA-target interactions can be used. Examples of these experimen- 
tal methods include high-throughput sequencing of RNA isolated by 
crosslinking immunoprecipitation (HITS-CLIP)” and photoactivata- 
ble-ribonucleoside-enhanced crosslinking and immunoprecipitation 
(PAR-CLIP)”. HITS-CLIP analysis of argonaute-bound miRNA- 
mRNA complexes has generated genome-wide interaction maps for 
specific miRNAs". Recently, integration of HITS-CLIP data was shown 
to enhance miRNA target predictions by more than 20-fold over com- 
putational approaches alone, and mRNA-miRNA predictions have 
been used to identify candidate ceRNA interactions in the regenerat- 
ing liver”. PAR-CLIP is a modification of the HITS-CLIP methodol- 
ogy with improved RNA recovery, which is able to indicate the exact 
targeting site more precisely. A complementary approach to these is 
MS2-tagged RNA affinity purification (MS2-TRAP), which can be used 
to identify all miRNAs associated with a target transcript in a particular 
cellular context®’. Harnessing these experimental techniques will pro- 
vide further insight into ceRNA regulation beyond that which is possible 
with in silico target predictions. 


Additional considerations for ceRNA activity 

Although multiple examples of ceRNA interactions have been 
described, little is known about the molecular conditions necessary for 
optimal ceRNA activity. Considerations such as the abundance of key 
players, potential interplay with RBPs and RNA editing may have pro- 
found effects on ceRNA crosstalk. 
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Figure 2 | Variable factors that may influence competing endogenous 
RNA (ceRNA) effectiveness. a, Steady state levels of ceRNA1 (protein- 
coding region shown in red) and ceRNA2 (protein-coding region 
shown in blue). Active repression by miRNAs is shown in light grey. b, If 
expression of ceRNA2 increases, this will increase expression of ceRNA1. 
c, ceRNA3 has a different subcellular localization from ceRNA1, and may 


Abundance of miRNAs, ceRNAs and argonaute 

Various factors, including miRNA- and ceRNA-expression levels and 
subcellular localization, the number of shared miRNAs and MREs, as 
well as binding affinity of the shared MREs have been suggested to con- 
tribute to ceRNA effectiveness’”’” (Fig. 2). For example, it has been 
hypothesized that miRNA-RNA competition would apply only to a 
small subset of miRNAs whose abundance and corresponding target 
abundance fell within a narrow range, as the expression of competi- 
tor RNAs would have little impact on regulation by highly abundant 
miRNAs, and low abundance miRNAs would be unlikely to contribute 
much to gene regulation because a minimal number of targets would 
be bound at any given time”’. 

Recent studies have further refined the dynamics and constraints of 
ceRNA crosstalk. Optimal conditions for ceRNA activity in silico have 
been determined using a mathematical mass-action model, and these 
were confirmed experimentally using PTEN and its validated ceRNA 
VAPA®. In addition, a minimal rate equation-based model to describe 
crosstalk between ceRNAS at steady state has been developed”. The 
relative abundance of both ceRNAs and miRNAs, their stoichiometry, 
the number of shared MREs and indirect interactions were shown to 
be crucial determinants of ceRNA cross-regulation. For instance, both 
studies found that ceRNA crosstalk was optimal when the transcript 
abundance of miRNAs and ceRNAs within a network were near equi- 
molarity. ceRNA crosstalk would be minimal when ceRNA transcript 
abundance vastly exceeded that of miRNAs owing to the limited num- 
ber of miRNAs available to repress their targets, as well as when miRNA 
transcript abundance vastly exceeded that of ceRNAs owing to maximal 
repression of most target transcripts. 

Strong correlations in network connectivity that enhance ceRNA 
crosstalk have also been observed”: ceRNA interactions may be medi- 
ated by a large number of miRNA species that individually have weak 
effects on their respective targets; and ceRNA regulation may be sym- 
metrical (two ceRNAs reciprocally regulate each other) or asymmetri- 
cal (one ceRNA regulates another but not vice versa). Intriguingly, a 
separate study® found that ceRNA and transcription factor networks 
were intricately intertwined and, with optimal molecular conditions, 
changes in the levels of a single ceRNA could affect integrated ceRNA 
and transcriptional networks. 

Another potential rate-limiting determinant of ceRNA crosstalk is the 
abundance of argonaute, the catalytic component of the RNA-induced 
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bea less effective ceRNA than ceRNA2. d, Increased expression of the 
shared miRNA will increase repression of both ceRNA1 and ceRNA2. e, 
ceRNA4 contains miRNA response elements (MREs, purple and orange) 
for multiple shared miRNAs, and may be a more effective ceRNA than 
ceRNA2. f, ceRNA5 contains more MREs than ceRNA2 and may be a more 
effective ceRNA. 


silencing complex (RISC). A mathematical model constructed to ana- 
lyse changes in gene expression caused by competition between small 
RNAs, showed that competition for argonaute binding occurs within an 
intermediate range of argonaute abundance, whereby lower amounts of 
argonaute result in stronger competition”. Surprisingly, this model also 
revealed that when argonaute is highly abundant, the overexpression of 
one miRNA decreases expression levels of its own targets as well as the 
targets of other miRNAs. This is a result of the overexpressed miRNA 
binding to targets that share MREs for that miRNA and other miRNAs, 
freeing the other miRNAs to regulate other target genes. 


Interplay between miRNAs and RBPs 

As already discussed, in addition to competing for shared miRNAs, 
RNA transcripts may also compete for shared RBPs. These two regula- 
tory functions are not always distinct and may in fact be intricately inter- 
twined. RBP and miRNA crosstalk may be either direct, whereby they 
compete or cooperate for binding to a specific binding site, or indirect, 
whereby RBP binding changes the RNA secondary structure and thus 
alters accessibility to miRNA-binding sites. For example, competition 
between the stabilizing RBP HuR and miRNAs usually enhances gene 
expression if the HuR-mRNA association is predominant, and coopera- 
tion between HuR and miRNAs typically reduces expression of the tar- 
get mRNA“. HuR has been shown to compete with miR-122, miR-548c, 
miR-494, miR-16 and miR-331-3p for binding to the CAT1, TOP2A, 
NCL, COX2 and ERBB2 mRNAs, respectively® ’’. Conversely, HuR has 
been demonstrated to recruit the let-7 RISC and miR-19 RISC to repress 
the expression of c-Myc and RhoB transcripts, respectively’*”. Other 
examples of miRNA and RBP crosstalk include a study that reported 
that the germ-cell-specific RBP Dnd1 bound mRNAs and blocked 
miRNA access to their target sites’; and another that demonstrated 
that competition between miR-4661 and TTP for binding to the ARE 
sequence in IL-10 mRNA protected IL-10 from TTP-mediated degrada- 
tion’’. The exact RBP species present in a specific biological setting and 
their relative abundance may thus have a profound impact on miRNA 
and ceRNA regulatory networks. 


RNA editing 

RNA editing refers to a post-transcriptional processing mechanism 
producing an RNA sequence that is different from its template DNA”. 
The most prevalent type of RNA editing in higher eukaryotes is the 
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hydrolytic deamination of adenosine to inosine in double-stranded 
RNA substrates (A to I RNA editing). As inosine has the same base- 
pairing properties as guanosine, it pairs preferentially with cytidine 
instead of uridine. This effectively alters the sequence and base-pairing 
properties of the edited RNA, and may affect ceRNA regulation in two 
ways. First, editing of miRNA transcripts may repress biogenesis, affect 
strand stability or even alter their target spectrum” second, editing 
of target RNA transcripts may abolish or create new miRNA-binding 
sites. One study reported the tissue-specific A to I editing of miR-376 
cluster transcripts, and identified one editing site located in the mid- 
dle of the ‘seed’ region, which is crucial for miRNA-target binding. 
Subsequently, the predominantly expressed edited miR-376 isoforms 
were shown to silence a different set of target genes compared with 
unedited miR-376 (ref. 80). This phenomenon is probably not limited to 
miR-376. Ina second study, it was suggested that up to 16% ofall human 
primary miRNAs may be targeted by ADARs”. A to I RNA editing was 
initially thought to be restricted to the coding regions of a few genes”, 
but recent genome-wide screening coupled with bioinformatic analyses 
has revealed numerous editing sites that are frequently located in non- 
coding repetitive RNA sequences”. As 85% of pre-mRNAsare predicted 
to be subject to A to I RNA editing, with most target sites located in 
introns and UTRs*””, this phenomenon may significantly increase the 
complexity of elucidating ceRNA interactions. 


Implications for human disease 

Many of the genes with known ceRNA interactors identified so far 
have been implicated in human disease. For example, PTEN is a potent 
tumour suppressor gene that is frequently disrupted in multiple can- 
cers and governs multiple cellular processes, including survival, pro- 
liferation, energy metabolism and cellular architecture®; HULC is the 
most upregulated gene in HCC and has been shown to regulate several 
genes involved in liver cancer’***; T cells that are infected with HSUR1- 
expressing H. saimiri cause aggressive leukaemias and lymphomas” 
and linc-MD1 expression in primary muscle cells isolated from patients 
with Duchenne muscular dystrophy has been shown to result in partial 
mitigation of the correct timing of the differentiation program™. This 
suggests that ceRNA crosstalk is not only of fundamental importance in 
physiological conditions, but is also crucially relevant in various patho- 
physiological states. 

In addition to the genomic amplifications, deletions, mutations and 
epigenetic modifications that dysregulate crucial gene function and 
result in disease pathogenesis, aberrant changes in ceRNA regulation 
may also contribute to disease initiation and progression. First, changes 
in the expression levels of ceRNA transcripts represent an additional 
mechanism for regulating the levels of key disease genes, such as tumour 
suppressors or oncogenes. ceRNA crosstalk may thus allow the func- 
tional characterization of transcripts with no other known function 
on the basis of their ceRNA activity. This is of particular importance 
because although many IncRNAs (for example, HULC) are expressed 
at low levels in steady state conditions, their expression is known to be 
dysregulated in various diseases®. Second, it has been shown that, com- 
pared with non-transformed cell lines, cancer cells in vitro often express 
mRNAs with shorter 3’ UTRs, which result from alternative cleavage 
and polyadenylation**. This phenomenon has also been reported in 
primary breast and lung cancer tumours”. Transcripts with shorter 3’ 
UTRsare not only subject to reduced miRNA regulation in cis, but will 
have diminished ability to act as ceRNAs through miRNA crosstalk in 
trans. Third, the altered expression of transcript variants of many genes 
through alternative splicing or differential use of initiation and/or stop 
codons has been linked to disease'!***”. These variants may produce 
proteins with different structures and functions, but may also introduce 
or eliminate MREs, and thus affect ceRNA activity. Fourth, chromo- 
somal translocations are frequent events in cancer that may contribute 
to tumorigenesis. These events result in the production of various fusion 
proteins, but could also potentially affect ceRNA regulation owing 
to misexpression of 3’ UTRs under the control of non-endogenous 
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promoters". Finally, SNPs (single-nucleotide polymorphisms) associ- 
ated with various diseases may create, modify or destroy MREs. For 
example, SNPs in genes related to Alzheimer’s disease may potentially 
dysregulate miRNA and ceRNA networks and thus contribute to aber- 
rant gene expression in Alzheimer’s disease”. These observations sug- 
gest that ceRNA network interactions may have a role in determining 
the effectiveness of RNA-directed therapies”. 


Future perspectives 

As ceRNA research is still in its infancy, there are only a handful of vali- 
dated examples of each existing class of ceRNAs, such as mRNAs, small 
non-coding RNAs, IncRNAs, pseudogenes and the recently described 
circRNAs (Tables 1 and 2). Although research should certainly focus 
on the identification of more ceRNAs in these categories to ascertain 
whether ceRNA crosstalk represents a widespread network of RNA reg- 
ulation, another exciting avenue for future work is the discovery of new 
classes of ceRNAs. For example, argonaute proteins have been found to 
interact with rRNAs and tRNAs, raising the possibility that these abun- 
dant, stable, small RNA species may also act as cCRNAs”. Intriguingly, 
3’ UTRs that are expressed independently of their associated protein- 
coding sequences have been described in humans, mice and flies, sug- 
gesting that they may represent another new class of ceRNAs”. 

Refinements in ceRNA prediction strategies will facilitate the dis- 
covery of additional ceRNA interactions. One limitation of current 
ceRNA prediction strategies is that they do not factor in miRNA and 
potential ceRNA expression levels. This is especially crucial as miRNAs 
and ceRNAs are known to have temporal, spatial and disease-specific 
expression patterns, and it has been suggested that ceRNA crosstalk 
applies mainly to a subset of ceRNAs and miRNAs whose cellular 
concentration and target abundance fall within a specific range of val- 
ues’*"+" A precise understanding of miRNA and ceRNA abundance 
is thus essential for deconvoluting any ceRNA network of interest in 
a particular context. Although paired miRNA and mRNA expression 
profiles have been integrated with bioinformatic target predictions to 
refine potential ceRNA interactions’”"*”*, they have been based mainly 
on relative quantitation methods such as microarray and quantitative 
PCR (polymerase chain reaction) analysis that are subject to varia- 
tions in probe binding and affinity. Techniques such as RNA sequenc- 
ing that allow the quantitation of absolute levels of both ceRNA and 
miRNA transcript abundance will allow a more precise determination 
of ceRNA effectiveness. In addition, the integration of methods such as 
HITS-CLIP and PAR-CLIP, which allow the biochemical identification 
of endogenous miRNA-target interactions on a transcriptome-wide 
level in specific cellular contexts, will further refine ceRNA predic- 
tion strategies that currently depend mainly on in silico bioinformatic 
miRNA predictions. 

Another limitation of current ceRNA prediction methods is their 
focus on crosstalk between canonical MREs that are located exclu- 
sively on the 3’ UTRs of target transcripts. A number of recent stud- 
ies have suggested that miRNA regulation is not limited to canonical 
3’ UTR targets. For example, several groups have demonstrated that a 
significant proportion of MREs in humans are non-canonical”*””, and 
others have shown that effective miRNA targeting can occur outside 
the 3’ UTR®*”. Furthermore, it has been shown that miRNA binding 
does not always result in target downregulation”; 5’ UTR structure has 
been demonstrated to be a crucial determinant of miRNA-dependent 
regulation”; and, for dynamically recruited miRNAs, changes in overall 
miRNA expression were found to not always be correlated with miRNA 
recruitment to the RISC”. These new insights into miRNA function 
should be incorporated into future ceRNA prediction analysis, and may 
potentially add further complexities to the dynamics of ceRNA regula- 
tion and significantly expand the realm of potential ceRNA activity. 

Although most of the ceRNA interactions identified so far have been 
between binary partners, there is increasing evidence that ceRNAs 
crosstalk in large interconnected networks; and that, in addition 
to direct interactions through shared miRNAs, secondary indirect 
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interactions may also have a profound effect on ceRNA regulation”. 


For example, PTEN expression has been shown to be regulated by 
ceRNA interactions between PTEN and its ceRNAs PTENP1, CNOT6L, 
VAPA, VCAN and ZEB2 (Fig. 2). In addition, indirect ceRNA crosstalk 
(for example, between VCAN and FN1), as well as secondary interac- 
tions between PTEN ceRNAs (for example, between PTENP1, CNOT6L 
and VAPA mediated by the miR-17 and miR-19 families) may also con- 
tribute significantly to the effectiveness of ceRNA regulation. Future 
ceRNA research should extend beyond the identification of binary 
ceRNA interactions and include network analysis of potential inter- 
twined miRNA and ceRNA networks. 

The conservation of ceRNA crosstalk in multiple organisms, includ- 
ing plants, zebrafish, mice, humans and viruses, suggests that it may 
represent an important widespread layer of RNA regulation. As already 
discussed, this regulation is not limited to non-coding RNAs, it also 
encompasses protein-coding mRNAs. Recently, the genome of the car- 
nivorous bladderwort Utricularia gibba was found to contain a typi- 
cal number of protein-coding genes, but significantly less non-genic 
DNA”, providing one example whereby a large non-coding genome is 
not essential for complex life. This presents an interesting opportunity to 
study potential RNA regulatory networks that involve mainly mRNAs 
and a much smaller non-coding transcriptome that consists of non- 
coding RNAs encoded in genic DNA such as circRNAs and antisense 
non-coding RNAs. 

In summary, analysis of ceRNA interactions and crosstalk in inter- 
twined networks may represent a robust platform to methodically func- 
tionalize non-coding transcripts on the basis of competition for shared 
miRNAs. Furthermore, the analysis sheds light on the non-coding 
function of protein-coding transcripts, and may uncover regulatory 
networks that have been overlooked by conventional protein-coding 
studies. To fully understand and appreciate the impact of ceRNA 
crosstalk on physiological and pathophysiological conditions, it will 
be of crucial importance to integrate these miRNA-RNA competitive 
networks with other competitive regulatory networks such as protein- 
RNA crosstalk. Although the full extent of ceRNA networks remains 
to be determined, this miRNA-RNA competition is a new post-tran- 
scriptional layer of endogenous competitive gene regulation, which has 
exciting implications for multiple basic biological systems, pathophysi- 
ological conditions and the development of new therapeutic approaches 
for cancer and other diseases. = 
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PIWI proteins and PIWI-interacting 


RNAs in the soma 


Robert J. Ross'*, Molly M. Weiner’* & Haifan Lin! 


The discovery of millions of PIWI-interacting RNAs revealed a fascinating and unanticipated dimension of biology. The 
PIWI-piRNA pathway has been commonly perceived as germline-specific, even though the somatic function of PIWI pro- 
teins was documented when they were first discovered. Recent studies have begun to re-explore this pathway in somatic 
cells in diverse organisms, particularly lower eukaryotes. These studies have illustrated the multifaceted somatic functions 
of the pathway not only in transposon silencing but also in genome rearrangement and epigenetic programming, with 
biological roles in stem-cell function, whole-body regeneration, memory and possibly cancer. 


acting RNAs (piRNAs), have significantly expanded the RNA 

world. piRNAs are generally 24-32 nucleotides in length and 
bind specifically to the PIWI subfamily of Argonaute proteins. Piwi was 
originally discovered in Drosophila’, in which it functions in germline 
stem-cell maintenance and self-renewal’. For clarity, we use PIWIs to 
refer collectively to PIWI proteins, whereas Piwi refers to the individual 
protein. Although piRNAs were discovered and formally defined in 
mammalian systems in 2006 as small non-coding RNAs that specifically 
interact with PIWIs**, cloning of piRNAs in Drosophila’ revealed that 
they include a previously discovered class of small non-coding RNAs 
called repeat-associated RNAs (rasiRNAs)'*"'. Since 2006, the PIWI- 
piRNA field has rapidly advanced, with a focus on the germ line, in 
which PIWIs and piRNAs are enriched and PIWI mutations lead to 
a profound infertility phenotype’*. Indeed, the name PIWI comes 
from the original mutant phenotype P-element-induced wimpy testis’. 
The best-known role of the PIWI-piRNA pathway in the germ line is in 
transposon silencing, because piRNAs map largely to transposable ele- 
ments’, with PIWI depletion leading to a drastic increase in transposon 
messenger RNA expression”. 

Despite the germline focus, since their discovery, PIWIs’ somatic 
function has long been documented. Initial work on the Drosophila 
gene piwi, the first identified member that defines the Argonaute gene 
family, determined that its germline function depends on the somatic 
cells of the gonad’. Recently, significant insight into the somatic function 
of the PIWI-piRNA pathway has come from the ovarian somatic cells 
of Drosophila. In addition, groundbreaking work in lower eukaryotes 
has demonstrated a conserved function for PIWIs and their associated 
piRNAs in somatic tissues — particularly in stem cells. In this Review, 
we focus on the role of the PIWI-piRNA pathway in the soma of diverse 
organisms, from basal eukaryotes to humans. We begin by looking at 
PIW1 expression and piRNA biogenesis in somatic tissues, and then illus- 
trate how the PIWI-piRNA pathway exerts diverse functions, including 
epigenetic regulation, transposon silencing, genome rearrangement and 
developmental regulation. Through this Review we hope to illustrate a 
broader role for the PIWI-piRNA pathway in the soma. 


T= discoveries of small non-coding RNAs, including PIW]-inter- 


Expression of PIWIs and piRNAs in somatic tissues 

PIWIs are expressed in organisms from sponges to humans. This 
expression occurs in remarkably diverse cell types, ranging from naive 
pluripotent stem cells to differentiated somatic cells, with most somatic 


expression related to various totipotent and pluripotent stem cells'°~ 


(Table 1). Several studies have demonstrated an essential stem-cell func- 
tion for PIWIs. One interesting example comes from planarian stem 
cells. piwi genes are expressed in planarian totipotent stem cells, called 
neoblasts, that can repopulate all somatic and germline lineages. Pla- 
narians have an unusual regenerative capability, in which neoblasts are 
responsible for maintaining and regenerating all tissues. PIWIs play a 
crucial part in the neoblast lineage, as illustrated by the two phenotypic 
consequences of knockdown of mRNAs encoding PIWIs: first, animals 
are incapable of body-part regeneration; and second, failure of tissue 
maintenance ultimately results in death”’. Furthermore, piRNA-like 
small RNAs, which depend on PIWIs for their biogenesis, have been 
identified in planaria. These RNAs are not germline-restricted, because 
ablation of the germ line by nanos mRNA knockdown does not affect 
piRNA-like RNA production’’”. In addition, RNA interference (RNAi) 
experiments in ascidians have demonstrated the requirement for PIWIs 
in whole-body regeneration (discussed later)”, 

Although substantial work has described PIWI expression beyond 
the germ line of lower eukaryotes, the somatic function of PIWIs was 
first characterized in somatic cells of the Drosophila ovary. The Dros- 
ophila Piwi protein is expressed in all somatic cells within the ovary 
and in early somatic cells of the testis”. Outside the gonad, Piwi binds 
to polytene chromosomes of the salivary gland”®. Emerging and unex- 
pected evidence shows that PIWIs function outside the gonad, par- 
ticularly in the Drosophila head. The first genetic evidence showed that 
piwi mutation leads to position effect variegation in the expression of 
the eye colour gene white in a clonal fashion*”*. This suggests the pos- 
sibility of cell-autonomous Piwi function in the eye. Direct molecu- 
lar evidence came from a more recent study in which two Drosophila 
PIWIs, Argonaute-3 and Aubergine, which were once thought to be 
germline-specific, showed region-specific expression in the brain, with 
their mutation leading to transposon upregulation in fly heads”. Inter- 
estingly, piRNA-like small RNAs mapping to transposons and hetero- 
chromatin are detectable in the Drosophila head on depletion of the 
RNAi effector Argonaute-2 (ref. 28). A recent publication has shown 
that all Drosophila PIWIs are expressed during early embryogenesis 
and that depletion of maternal Drosophila PIWIs results in profound 
chromosomal and mitotic defects, thus establishing a crucial function 
for PIWIs in the earliest stage of somatic development”. Together with 
the work on basal eukaryotes, studies in Drosophila highlight the exist- 
ence and function of PIWIs in somatic tissues. Although somatic PIWIs 
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Table 1 | Piwi orthologue expression 


Phylum Common name Species Known Piwi genes PIWI protein expression Reference 
Porifera Sponge Ephydatia fluviatilis EfPiwiA and EfPiwiB — Archeocytes (stem cells that differentiate into both somatic 16 
and germ cells) 
Cnidaria Jellyfish Clytia hemisphaerica Piwi Somatic stem cells of the tentacle bulb (produce stinging 17 
cells characteristic of the cnidarians) 
Jellyfish Podocoryne carnea Cniwi Somatic stem cells of the tentacle bulb (see above); striated 18 
muscle cells capable of transdifferentiation 
Ctenophora Comb jellyfish Pleurobrachia pileus PpiPiwil and PpiPiwi2 Actively dividing adult somatic cells; germ line 19 
Platyhelminthes Planaria Schmidtea mediterranea smedwi-1,smedwi-2 Neoblasts (totipotent stem cells that can repopulate all 21,22 
and smedwi-3 somatic and germline lineages) 
Saltwater flatworm Macrostomum lignano = macpiwi Neoblasts (see above) 20 
Mollusca Sea slug Aplysia californica Piwi Nervous system, heart and germ line 80 
Arthropoda Fruitfly Drosophila melanogaster piwi,aubandAGO3 — Gonad, brain, salivary gland 2,26, 92,93 
Chordata Sea squirt Botrylloides leachiiand = Piwi Stem cell population (capable of whole-body regeneration) 23,24 
(ascidian) Botryllus schlosseri 
Mouse Mus musculus Miwi, Miliand Miwi2 Diverse cancers (breast cancer, rhabdomyosarcoma, 12-14, 33 
medulloblastoma); male germ line 
Human Homo sapiens HIWI, HILI, HIWI2 and Diverse cancers (breast cancer, cervical cancer, endometrial 30, 32-36 


HIWI3 


clearly function in Drosophila, it remains a mystery whether piRNAs are 
generated outside the gonad, because some of the most studied piRNA 
biogenesis factors are not robustly expressed in non-gonadal somatic 
tissue (discussed later). 

In light of the above findings, an important question remains: are 
PIWIs present in mammalian somatic tissues? There are four human 
PIWIs: HIWI (also known as PIWIL1), HILI (also known as PIWIL2), 
HIWI2 (also known as PIWIL4) and HIWI3 (also known as PIWIL3). 
HIWLis expressed in haematopoietic stem cells but not in their differ- 
entiated progeny. Although this once led to the suggestion that HIWI 
might be involved in the stemness of these stem cells*, a powerful 
genetic experiment in which all three mouse Piwi genes were knocked 
out showed no detectable effect on haematopoiesis. Thus, PIWI expres- 
sion in haematopoietic stem cells may not have any functional impli- 
cation or PIWI function is redundant with other proteins, possibly 
Argonaute subfamily proteins’. With regards to human health, many 
studies demonstrate PIWI expression in a wide variety of human can- 
cers; however, these data are at best correlative and it is too early to tell 
whether PIWIs have any role in cancer (discussed later)? **. 

The expression of PIWIs in mammalian somatic tissues implies the 
potential existence of somatic piRNAs. One study provided evidence 
of piRNA expression in diverse somatic tissues of both the mouse and 
macaque’. However, the lack of appropriate negative controls com- 
bined with a small library size make it difficult to differentiate between 
true piRNA expression and contamination during library preparation. 
Indeed, this study might be a cautionary tale describing the challenges 
in the search for mammalian somatic piRNAs. Further work will help 
to determine whether piRNAs are expressed outside the germ line in 
mammals. Perhaps malignant tissues are good sites to begin the search 
given the expression of PIWIs in cancer. 


piRNA biogenesis in the Drosophila ovarian soma 

Our mechanistic understanding of somatic piRNA biogenesis comes 
from recent work in the somatic follicle cells of the Drosophila ovary. 
piRNAs in the ovarian soma are generated by a Piwi-dependent mecha- 
nism through the primary piRNA biogenesis pathway**”’. This pro- 
cess is independent of the other two Drosophila PIWIs, Aubergine and 
Argonaute-3, which function as a piRNA amplification loop in the ger- 
mline cells (known as secondary piRNA biogenesis). In the primary 
piRNA biogenesis pathway, long piRNA precursors are transcribed from 
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carcinoma, seminomas, hepatocellular carcinoma, gastric 
cancer, pancreatic adenocarcinoma, gastrointestinal 
stromal tumours, colon cancer, renal cell carcinoma); 
haematopoietic stem cells; and male germ line 


specific genomic loci known as piRNA clusters, cleaved and modified 
in the cytoplasm, and then transported into the nucleus in complex 
with Piwi (Fig. 1). Ovarian soma-specific piRNAs are transcribed from 
two main loci. The flamenco locus contains transposon remnants and 
encodes a long single-stranded piRNA precursor that is antisense to 
flamenco’s component transposons. Precursors generated from the 
flamenco locus target transposons of the gypsy family of long terminal 
repeat (LTR) transposons, including gypsy, ZAM and idefix”. Another 
ovarian soma-specific locus, traffic jam, has two functions: the 3’ UTR 
of its transcript is a substrate for somatic piRNA production, whereas 
its protein product drives Piwi expression in ovarian somatic cells”. 

Itis unclear how long single-stranded piRNA precursors are exported 
from the nucleus and how these precursors are initially processed into 
smaller fragments in the cytoplasm. One proposed, but unproven, candi- 
date for piRNA 5’ end determination is Zucchini, an outer mitochondrial 
membrane protein with single-strand-specific endonuclease activity 
in vitro”. In Drosophila, the maturing precursor then enters the peri- 
nuclear Yb body*’™, an ovarian soma-specific cytoplasmic structure 
named after the Yb protein“. Although it is not yet clear what occurs in 
the Yb body, several of its components are crucial for the processing of 
piRNA precursors into mature piRNAs, and for subsequent Piwi nuclear 
localization. These components include the helicase Armitage” and 
the TUDOR-domain-containing protein Vreteno“. Both Vreteno” and 
Yb*'* are needed for Piwi expression and/or stability. Zucchini also 
functions in piRNA maturation. Knockdown of the mRNA encoding 
Zucchini in a Drosophila ovarian somatic cell line leads to cytoplasmic 
accumulation of piRNA-intermediate-like molecules** and accumula- 
tion of Piwi in the Yb body; correspondingly, nuclear Piwi is absent. 
The mouse homologues of the above piRNA biogenesis factors, includ- 
ing Zucchini”, Armitage*”, Shutdown” and Vreteno” are crucial 
for piRNA biogenesis in the testes, and male mutants are infertile, thus 
implying conservation of these factors in piRNA biogenesis. 

Next, piRNAs are loaded onto Piwi in the cytoplasm through an 
uncharacterized step that is independent of Piwi slicer activity and 
nuclear localization signal’”*’. The co-chaperone Shutdown, which is 
essential to piRNA biogenesis in both the ovarian soma and germ line, 
may function in piRNA loading™. Ovarian somatic piRNAs contain 
a characteristic 5’ U, but the mechanism generating this 5’ U bias is 
unknown. It is possible that Piwi itself preferentially binds RNAs with 
5'U, as is the case for silkworm Piwi in vitro™. This might authenticate 
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Figure 1 | piRNA biogenesis in the Drosophila ovarian soma. piRNAs 

are generated from specific genomic loci known as piRNA clusters, which 
include flamenco, the 5’ UTRs of mRNAs and traffic jam in the soma. The 
long single-stranded piRNA precursor (red) is then exported from the 
nucleus. In the cytoplasm, the precursors are processed into mature piRNAs. 
The precursors are cleaved by an unknown endonuclease to generate their 

5’ end, and transported into the perinuclear Yb body for further processing. 
The precursors are loaded onto Piwi in a process that is dependent on Yb, 


and stabilize piRNA intermediates with 5’ U for further processing. 
An unknown protein then trims the 3’ ends of the maturing piRNAs. 
The shared structure of Argonaute family proteins suggests the pos- 
sibility that piRNA length may be determined by the number of bases 
protected within Piwi*. This also explains the characteristic size pro- 
file of piRNAs bound by each of the three Drosophila PIWIs’. Mature 
piRNAs are then 2’-O-methylated by a conserved methyltransferase, 
Pimet”, perhaps to ensure their stability. Finally, in an uncharacterized 
step, the Piwi-piRNA complex is imported into the nucleus to exert its 
regulatory function. 


Epigenetic regulation by the PIWI-piRNA pathway 

Strong evidence indicates that Piwi and piRNAs play a crucial part in 
epigenetic regulation. Pioneering work has shown that Piwi functions 
in the transcriptional silencing of Adh transgene arrays, and small RNAs 
have been proposed to have a role in Piwi-mediated silencing. Piwiisa 
predominantly nuclear protein” that localizes to salivary gland polytene 
chromosomes in an RNA-dependent manner”, and co-localizes with 
known epigenetic modifiers such as the Polycomb group proteins”. 
There is general agreement that Piwi is required for appropriate histone 
methylation and transcriptional gene silencing, as loss of Piwi in both 
germ line and soma leads to a decrease in the repressive methylation 
mark on histone 3 lysine 9 (H3K9)*®, an increase in RNA Pol II occu- 
and an increase in nascent transcript”. Indeed, tissue- 
specific knockdown of Piwi in somatic ovarian follicle cells implicates 
transcriptional repression as the dominant mode of transposable ele- 
ment control in the soma”. 

What is the mechanism by which Piwi directs epigenetic modifica- 
tion? The answer to this lies in piRNA, which probably guides Piwi to 
specific target sequences in the genome by sequence complementa- 
rity. Chromatin immunoprecipitation (ChIP) data from our labo- 
ratory suggest that the Piwi-piRNA complex binds its genomic target 
in euchromatin through a nascent transcript (often a long non-coding 
RNA), and in heterochromatin predominantly through a direct piRNA- 
DNA interaction’. According to these results, which await reproduc- 
tion by other labs, the Piwi-piRNA complex then recruits epigenetic 
factors such as HP 1a and the histone methyltransferase Su(var)3-9 to 


Armitage (Armi) and Vreteno. Overlapping proteins indicate protein- 
protein interaction. The co-chaperone Shutdown plays an uncharacterized, 
but crucial, part in piRNA loading. The putative endonuclease Zucchini 

(Zuc) is required for piRNA maturation and for nuclear localization of 

Piwi. Subsequently, piRNAs are trimmed to the appropriate length by an 
unidentified exonuclease and 2’-O-methylated at the piRNA 3’ end, rendering 
them more stable. The Piwi-piRNA complex is then transported into the 
nucleus, where it modulates chromatin state. 


exert their function (Fig. 2). HP1a is a highly conserved chromatin 
factor whose N-terminal chromodomain binds trimethylated H3K9 
whereas the C-terminal chromoshadow domain dimerizes with the 
same domain in another HP1la molecule and interacts with other pro- 
teins”. Piwi may directly recruit HP1a”®. HP1a may then recruit one 
of its well-characterized interactors, Su(var)3-9, which is responsible 
for most H3K9 methylation in Drosophila”. Alternatively, others have 
proposed that Piwi may first recruit a histone methyltransferase such 
as Su(var)3-9 or Setdb1. The methyltransferase could establish H3K9 
methylation and in turn recruit HP 1a; this possibility is supported by 
work in fission yeast”. In addition to these core epigenetic factors, 
Asterix (also known as DmGTSF1), an upstream nuclear factor, is 
required for Piwi-directed H3K9 methylation”. A recently identified 
downstream effector of Piwi, Maelstrom, is not required for the estab- 
lishment of H3K9 methylation but is required for the transcriptional 
silencing of transposable elements”. 

Several studies have revealed unanticipated epigenetic gene regula- 
tion carried out by Piwi. In Drosophila simulans, maternally deposited 
piRNAs against the retrotransposon tirant initiated H3K9-mediated 
transcriptional gene silencing of this element in the somatic tissues of 
developing embryos”. This study shows that maternal germline-derived 
piRNAs are required for somatic epigenetic programming. A recent 
study expanded on this idea by showing that both maternal and zygotic 
Piwi are required for establishment of heterochromatin in non-gonadal 
somatic cells of the early embryo”. Interestingly, Piwi-targeting of trans- 
posable elements for silencing means that those genes containing or in 
proximity to transposable element sequences may be piRNA pathway 
targets. The presence of a transposon or its remnants in an intron or in 
proximity to a gene correlates with significant transcriptional repres- 
sion”. Surprisingly, Piwi can also act as an epigenetic activator. In 
Drosophila, Piwi establishes euchromatic features of chromosome 3R 
telomere-associated sequence (3R-TAS)*, and whole-genome studies 
have shown that Piwi binding may enhance transcriptional activation 
marks in multiple regions™. In support of this unexpected epigenetic 
function, a recent independent study confirmed that piwi mutant flies 
have increased HP 1a enrichment in regions, including 3R-TAS, in which 
Piwi is implicated as an epigenetic activator”. How can Piwi-binding lead 
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Figure 2 | Piwi-piRNA mediated epigenetic regulation. Simplified 
illustrations of the currently proposed models of Piwi-mediated transcriptional 
gene silencing. a, In heterochromatin, Piwi may be guided to its target 
sequences by the complementarity of its bound piRNA (red) to genomic DNA. 
On binding, Piwi recruits the epigenetic modifier HP la that then recruits the 
major Drosophila histone methyltransferase Su(var)3-9, which then deposits 

a methyl group on the unmethylated histone 3 lysine 9 (H3K9). Through an 
unknown mechanism, a critical mass of the H3K9 repressive chromatin marks 
inhibits Pol II transcription, effectively silencing the Piwi-piRNA target. b, 

In euchromatin, Piwi targets nascent transcripts (green) by piRNA sequence 
complementarity. Subsequent to this, there are multiple working models. Piwi 
may directly recruit HP 1a, which in turn recruits a histone methyltransferase 
(Su(var)3-9 is shown) that then deposits methyl groups on the unmethylated 
H3K9. Alternatively, Piwi may directly recruit the histone methyltransferase, 
and subsequently HP 1a may then bind the methylated H3K9. Regardless of 
the mechanism, the net effect is that these repressive marks, in concert with 
Maelstrom (not shown), inhibit RNA Pol II transcription. 


to transcriptional silencing in most target sites, but activation in a small 
number of target sites? Perhaps Piwi interacts with different partners 
and/or is influenced by the local chromatin micro-environment or its 
sublocalization within the nucleus. Clearly, Piwihas many different roles 
in avariety of cellular processes, and one challenge in the field is to unite 
or to further distinguish between these many functions. 


PIWI-piRNA pathway in genome rearrangement 
Ciliates are single-celled organisms that possess remarkable nuclear 
dimorphism. They have two distinct genomes: a somatic macronucleus 
that functions in vegetative growth and is actively transcribed, and a 
transcriptionally silent germline micronucleus that functions in the 
exchange of genetic information during sexual reproduction, known 
as conjugation. Following conjugation and mitosis, the zygotic genome 
in one of the two micronuclei in each daughter cell is extensively edited 
to create a new and partial somatic genome that replaces the parental 
germline genome, in a process known as genome rearrangement or, 
more appropriately, somatic elimination. During somatic elimination, 
repetitive sequences are removed from the genome before polyploidiza- 
tion, thus preventing their transcription in the resultant macronuclei”. 
Although other organisms have complex mechanisms for transcriptional 
and post-transcriptional silencing of transposable elements, the ciliates 
simply delete such sequences from the somatic macronuclear genome. 
Early studies in the ciliate Tetrahymena revealed that their Piwi ortho- 
logues are essential for proper genome rearrangement and piRNA pro- 
duction. These piRNAs, termed scan RNAs in the original literature, 
are derived from the micronucleus during conjugation. Importantly, 
Tetrahymena piRNAs are distinct from those in other species as they are 
produced through a Dicer-dependent mechanism from double-stranded 
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RNA precursors”. In this sense, they are more like endogenous small 
interfering RNAs” (endo-siRNAs). After mating, the piRNAs provide 
sequence-specificity to target germline-restricted regions in the develop- 
ing somatic macronucleus for elimination (Fig. 3)’°; consequent histone 
methylation then groups these sequences for collective deletion”. In the 
current model, for which there is strong evidence in both Tetrahymena 
and Paramecium, long non-coding RNAs transcribed from the paren- 
tal somatic macronucleus act as sponges for germline-derived piRNAs. 
Thus, unbound piRNAs specifically mark any non-somatic regions in 
the genome for elimination in the developing daughter macronucleus”. 

By contrast, piRNAs in the ciliate Oxytricha complex with a Piwi 
orthologue, Otiwil, to mark somatic genes for retention during devel- 
opment of the somatic macronucleus. PIWIs are so crucial to Oxytricha 
that on knockdown of the mRNA encoding Otiwil the organisms do 
not survive after mating. Furthermore, injection of RNAs that target 
normally deleted genes leads to their retention through multiple sexual 
generations”. This suggests that genomic composition changes of the 
somatic macronucleus are heritable across multiple generations. In Tet- 
rahymena, the minority of the developing somatic genome is directed for 
deletion, whereas in Oxytricha the minority of the developing somatic 
genome is directed for retention. Although it is not known why these 
distantly related species solve this puzzle of genome rearrangement in 
opposite ways, it is clear that each has evolved the most efficient system 
with the fewest possible piRNA targets to direct the assembly of a com- 
plete somatic macronuclear genome”. 


PIWI-piRNA pathway in somatic development 

The PIWI-piRNA pathway has broad developmental functions in diverse 
organisms, from memory in the sea slug Aplysia to whole-body regen- 
eration in botryllids. In Aplysia, both synaptic plasticity and associative 
memory formation require the PIWI-piRNA pathway. Aplysia Piwi, in 
complex with piRNAs, responds to the neurotransmitter serotonin by 
directing CpG methylation of the CREB2 promoter. CREB2 is a major 
inhibitor of memory in Aplysia, so Piwi-mediated transcriptional silenc- 
ing of CREB2 results in memory through long-lasting, cell-wide enhance- 
ment of synaptic transmission®. The capacity of the PIWI-piRNA 
pathway to epigenetically regulate genes is thus exerted in mature neurons 
to promote cellular memory, emphasizing PIWI function in differenti- 
ated somatic tissue. 

Colonial ascidians are chordates that are capable of whole-body regen- 
eration. Remarkably, the ascidian Botrylloides leachii can regenerate its 
entire body from any blood vessel fragment containing only a few cells. 
This phenomenon depends ona population of Piwi-positive cells present 
on the luminal side of the vascular epithelium. On RNAi knockdown of 
the mRNA encoding Piwi, organisms cannot undergo whole-body regen- 
eration”. In the closely related Botryllus schlosseri, Piwi is also crucial for 
whole-body regeneration. Piwi positive-cells, which contribute to both 
the germline and somatic lineages of future generations, reside within 
the endostyle niche. B. schlosseri undergoes weekly asexual growth, dur- 
ing which a population of Piwi-positive stem cells vacates the original 
endostyle niche and migrates to the niche within the growing daughter 
organism before the parent niche undergoes massive apoptosis. In this 
way, the stem-cell population efficiently preserves itself over the course 
of the colony’s lifetime through migration to the offspring™. 


Piwi suppresses phenotypic variation 

The PIWI-piRNA pathway has a direct role in buffering against pheno- 
typic variation, and Piwi depletion in Drosophila results in new somatic 
defects in a random fashion and at low frequency. Canalization, a term 
coined by Conrad Waddington in 1942 (ref. 81), describes develop- 
mental robustness: the ability of a system to generate a single phenotype 
regardless of genetic or environmental perturbations. The heat-shock 
protein Hsp90 is a crucial chaperone in the suppression of phenotypic 
variation, that is, in canalization. Depletion of Hsp90 generates diverse 
somatic phenotypic variants in species ranging from Arabidopsis” to 
Drosophila**. These variants are heritable even in the absence of heat 
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shock or other forms of stress, and even on repletion of the wild-type 
Hsp90 levels in progeny, indicating that the effect is not simply due to 
the chaperone function of Hsp90 during stress. Drosophila Piwi forms 
a complex with Hsp90 and the heat-shock organizing protein, Hop, 
in vivo to suppress phenotypic variation™. 

This buffering of variation is probably accomplished through both 
genetic and epigenetic mechanisms. Evidence for a genetic mechanism 
stems from a study in which Hsp90 depletion in Drosophila leads to 
defective transposable element silencing by the PIWI-piRNA path- 
way. The resultant transposon-mediated mutagenesis may generate 
new phenotypes*’. However, the transposon mutagenesis hypothesis 
cannot explain the full spectrum of canalization phenotypes. Although 
one copy of maternal piwi is sufficient for transposon silencing, it is 
insufficient to suppress phenotypic variation™. In addition, maternal 
epigenetic factors, such as the trithorax group of genes that maintain 
active chromatin, also buffer against phenotypic variation in Dros- 
ophila®’. These observations indicate the involvement of an epigenetic 
mechanism in suppressing phenotypic variation. With regards to the 
inheritance of such phenotypes, work in the Caenorhabditis elegans 
germ line has shown that piRNAs induce transgenerational epigenetic 
inheritance’. Regardless of the mechanism, the PIWI-piRNA path- 
way clearly functions to suppress expression of new phenotypes and 
to maintain developmental robustness in Drosophila. 


The uncertain meaning of PIWI in cancer 

Cancer stem cells may help to explain resistance to cancer treatment and 
relapse after treatment in certain forms of cancer. Thus, great interest 
exists in developing our understanding of the basic biology of cancer 
stem cells and in identifying factors that drive their stemness. A large 
number of studies document the ectopic expression of PIW]Is in cancers. 
This was first reported in seminoma, a testicular germ-cell tumour in 
which HIWI was drastically overexpressed”. Related to this overexpres- 
sion, HIWI mapped to a genomic region linked to seminomas and non- 
seminomas. Since then, PIWI expression has been shown in a variety 
of somatic cancers: HIWI is expressed in gastric cancer****, whereas 
HILI is expressed in breast cancer, colon cancer, gastrointestinal stro- 
mal tumours, renal cell carcinoma and endometrial carcinoma’’. Some 
early studies also suggested that PIWI expression could be used as a 
prognostic marker”. In both hepatocellular carcinoma™ and soft-tissue 
sarcoma’, tumour HIWI expression is associated with increased risk 
of tumour-related death. The discovery of PIWI expression in diverse 
forms of human somatic cancers opens up a promising area of research, 
especially given the well-established role of PIWIs in stem-cell mainte- 
nance and self-renewal. 

Consistent with this notion, HILI enrichment was reported in a 
cancer cell subpopulation expressing the stemness factors OCT4 and 
NANOG™. In addition, a large variety of embryonic and developmen- 
tal genes are expressed in cancers. However, this does not necessarily 
imply that they have a causative role in tumorigenesis. Such a conclu- 
sion will need to be based on functional studies. 

At present, such functional studies are scarce. HILI overexpression in 
a fibroblast cell line activates STAT3 and the antiapoptotic factor BCLX, 
suggesting that HILI might function as an oncogene”. Genetic studies 
in Drosophila revealed that piwi mutation attenuates tumour growth in 
a sensitized lethal (3) malignant brain tumour (I(3)mbt)-mutant back- 
ground. Other piRNA pathway genes were also upregulated in /(3)mbt 
tumours. These findings are perhaps the strongest data available at pre- 
sent in correlating ectopic PIWI expression with tumour growth”. 

These preliminary mechanistic data are promising, but research 
on PIWIs in cancer remains at an early stage and is primarily cor- 
relative. It is known that insertional mutagenesis by LINE] elements 
is common in human epithelial cancers”'. Therefore, it is possible 
that PIWIs are expressed in reaction to increased transposon activ- 
ity in cancer, and thus act to protect the genome. Perhaps an in vivo 
overexpression model would be a good starting point to determine 
whether ectopic or overexpression of a PIWI protein can actually 
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Figure 3 | Somatic genome rearrangement in ciliates. Unicellular ciliates 
posses two nuclei: the germline micronucleus and the somatic macronucleus. 
After mating, the developing macronucleus is extensively edited to remove 
germline-specific sequences. The new mature somatic nucleus contains 

only genes needed for somatic (vegetative) growth. The rest of the germline- 
specific genome (red boxes) undergoes DNA elimination, an extraordinary 
method to purge the somatic genome of repetitive sequences and transposons. 
This process, called somatic genome rearrangement, differs between 
Tetrahymena and Oxytricha. a, Tetrahymena piRNAs (red lines) are generated 
by the germ line, and target germline-specific sequences of the developing 
somatic macronucleus for elimination. b, Oxytricha piRNAs (blue, orange and 
purple lines) are generated by the parent somatic macronucleus, and direct the 
retention of somatic genes in the mature somatic macronucleus (blue, orange 
and purple boxes). 


cause cancer, or is merely a consequence of tumorigenesis. Without 
a doubt, more large-scale, systematic research is needed before we 
can conclude whether or not human PIWIs have any role in cancer. 


Outstanding questions 
PIWIs occupy the interface between stem-cell and small RNA biology. 
They serve diverse roles in diverse tissues, from totipotent stem cells to 
totally differentiated cell types, from the germ line to the soma. Tanta- 
lizing questions remain, including how broadly are PIWIs and piRNAs 
expressed in mammalian somatic tissues? And, what is their function 
there, if any? The mammalian soma awaits our rigorous interrogation. At 
a more fundamental level, what are the molecular mechanisms by which 
Piwi regulates stemness? Conversely, what exactly is Piwi doing in dif- 
ferentiated tissues? The lower eukaryotes, in which we are making rapid 
progress on this topic, provide an excellent arena for these investigations. 
The reason underlying ectopic expression of PIWIs in many human 
cancers is still a mystery. Do PIWIs have a role in cancer? Ifso, do PIWIs 
provoke dedifferentiation or lend a competitive advantage by enhancing 
stemness or even as a reactive mechanism to suppress transposition? 
Finally, are piRNAs ectopically expressed in cancers? If so, what is their 
function? Answers to these questions will not only shed light on the 
function of PIWIs and piRNAs but could also mark paths that are ripe 
for cancer research. m 
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CNVs conferring risk of autism or 
schizophrenia affect cognition in controls 
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In asmall fraction of patients with schizophrenia or autism, alleles of copy-number variants (CNVs) in their genomes are 
probably the strongest factors contributing to the pathogenesis of the disease. These CNVs may provide an entry point for 
investigations into the mechanisms of brain function and dysfunction alike. They are not fully penetrant and offer an 
opportunity to study their effects separate from that of manifest disease. Here we show in an Icelandic sample that a few 
of the CNVs clearly alter fecundity (measured as the number of children by age 45). Furthermore, we use various tests of 
cognitive function to demonstrate that control subjects carrying the CNVs perform at a level that is between that of 
schizophrenia patients and population controls. The CNVs do not all affect the same cognitive domains, hence the 
cognitive deficits that drive or accompany the pathogenesis vary from one CNV to another. Controls carrying the 
chromosome 15ql11.2 deletion between breakpoints 1 and 2 (15q11.2(BP1-BP2) deletion) have a history of dyslexia and 
dyscalculia, even after adjusting for IQ in the analysis, and the CNV only confers modest effects on other cognitive traits. 
The 15q11.2(BP1-BP2) deletion affects brain structure in a pattern consistent with both that observed during first- episode 


psychosis in schizophrenia and that of structural correlates in dyslexia. 


Little information is available on whether or how rare CNVs confer- 
ring high risk of schizophrenia and/or autism affect physiologic func- 
tion of otherwise normal brains. As none of these CNVs hitherto described 
are fully penetrant for the diseases, and both schizophrenia and autism 
affect cognition, we aimed to examine the possibility that the CNVs affect 
cognition in control carriers, those who do not suffer either disease or 
intellectual disability. We based our selection of CNVs on a literature 
search for CNVs associated with schizophrenia and/or autism (‘neuro- 
psychiatric CNVs’); this search produced 26 CNV alleles (Supplemen- 
tary Table 1)’°. These CNV alleles are rare, found in 0.002% to 0.2% 
frequency, and cumulatively in 1.16% of our sample of 101,655 geno- 
typed subjects, representing approximately one-third of the Icelandic 
population (Supplementary Tables 1 and 2). 

We used the subset of genotyped subjects born before 1968, without 
excluding patients, to examine the association of each neuropsychia- 
tric CNV with reproductive outcome (‘fecundity’), defined simply as 
the number of children each subject had by age 45. After correction for 
multiple comparisons, three neuropsychiatric CNVs were significantly 
associated with fecundity (Table 1). Subjects carrying the 16p11.2 dele- 
tion or the 22q11.21 duplication show reduced fecundity, with the effect 
in males significantly greater than in females (P = 0.0083 and P = 0.029 
for the difference in effect by sex for the 16p11.2 deletion and the 22q11.21 
duplication, respectively). In contrast, individuals carrying the 16p12.1 
deletion have more children than do controls (Table 1). Those with 
deletions at 15q11.2(BP1-BP2) showa nominally significant reduction 
in fecundity (Table 1). Consistent with previous reports’, schizophrenia 


patients show a large decrease in fecundity, with a more pronounced 
reduction in males (P = 9.5 X 10° *° for the difference in effect by sex) 
(Table 1). 

We recruited neuropsychiatric CNV control carriers, controls carrying 
other CNVs not known to be associated with schizophrenia or autism 
(other CNVs’), controls without large CNVs, and schizophrenia patients. 
All recruited subjects (Supplementary Table 2 and Supplementary Fig. 1) 
were administered a battery of neuropsychological tests (see Methods), 
the mini international neuropsychiatric interview (MINI)° and the 
general assessment of function scale (GAF)°. 


The neuropsychiatric CNVs as a class 


We found that the GAF score is 0.70 standard deviations (s.d.) lower 
in the group of neuropsychiatric CNV control carriers than in popu- 
lation controls (P = 2.2 X 10”). Based on MINIs, anxiety and sub- 
stance abuse prevalences in the neuropsychiatric CNV control group 
are similar to those of controls (P = 0.27 and 0.36, respectively), however, 
depression and suicidal ideation are more common (odds ratio = 2.86, 
P= 0.0017, and odds ratio = 2.20, P = 0.011, respectively). The other 
CNVs have GAF scores 0.18 s.d. lower than population controls 
(P = 0.0098), but do not differ significantly from controls in prevalence 
of phenotypes assessed by the MINI (P = 0.22, 0.90, 0.97 and 0.097, for 
depression, suicidal ideation, anxiety and substance abuse, respectively). 

Neurocognitive deficits, or heritable neurocognitive traits, are seen 
in those at risk of schizophrenia and in unaffected family members”*. 
They typically distinguish patients with schizophrenia from controls 
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Table 1 | Fecundity of neuropsychiatric CNV carriers and schizophrenia patients based on individuals born before 1968 


CNV Carriers (male/female) Non carriers (male/female) Effect (male/female) P value 
16p11.2 del 14/13 33910/42223 0.14/0.51 1.6 x 10772 
22q11.21 dup 25/33 33899/42203 0.60/0.90 0.00093 
16p12.1 del 36 76124 1.32 0.0011 
15q11.2 del 73/99 33851/42137 0.81/1.03 0.015 
q21.1 del 21 76139 0.76 0.062 
15q11.2-13.1 dup 9 76151 0.69 0.11 
6p13.1 dup 90 76070 0.91 0.14 
6p11.2 distal del 11 76149 0.76 0.2 
16p13.1 del 28 76132 0.86 0.22 
7q12 dup 28 76132 1.11 0.3 
2p16.3 (NRXN1) del 10 76150 0.83 0.34 
16p11.2 dup 26 76134 0.89 0.35 
22q11.21 del 7 76153 0.79 0.37 
17p12 del 24 76136 1.08 0.48 
13q31.3 (GPC6) dup 76 76084 0.96 0.55 
0q11.22-23 dup 13 76147 1.08 0.62 
10q11.22-23 del 12 76148 0.96 0.81 
q21.1 dup 33 76127 1.02 0.88 
5q13.1 dup 17 76143 1.02 0.9 
225.3 (MYT1L) dup 106 76054 0.99 0.91 
5q13.3 all del 18 76142 1.01 0.95 
Schizophrenia 306/197 33618/42039 0.21/0.54 9.5 x 10°06 
he effect is the factor by which fecundity is altered in CNV carriers or schizophrenia patients. Different effects on males and females are shown when there is a significant (P < 0.05) interaction between sex and 


either CNV or patient status. Counting both models fitted (with and without sex interaction), 42 tests were performed, thus the significance threshold for CNVs is P= 0.0012. 


a 30-4 
— Population controls 4 
— Other CNV controls 
2.57 —— Neuropsychiatric CNV controls 
— Schizophrenia 
2.05 i 
E 1.55 i ? t z 
g t oe 
smile oe: : 
¢ 
4 $ é 
0.5 ;ti : Leaieg J 
* 3 ry ry 
ooze Ee a BE oe ee ae 
-0.5-4 
CGT ee eh @af5 % 2 oC 
= = Oo fo) Ss -— a 
> a of £8 o Oo 
iy a SO BB” a= 
9 n 
5 g 
b 394 
— Population controls 
— Other CNV controls 
2.5 —— Neuropsychiatric CNV controls Cy 
— Schizophrenia 
2.05 
5 15- 
E 
‘ow 
eS 1.05 r 
£ 1. i ; 
: ¢ $ 
0.55 3 t t ? 
0.04 § ri ry z a é 
-0.5-4 
=u WwW ae ow ao fr wb 
275 Ree ee E 882 2 
: 5 2a Se fe 
g 2) 
5 g 


362 | NATURE | VOL 505 | 16 JANUARY 2014 
©2014 Macmillan Publishers Limited. All rights reserved 


with an effect size of approximately one s.d. (ref. 9). The cognitive 
deficits seen in first-degree relatives of schizophrenia patients indicate 
that they are partly independent of the clinical state’® and are likely to 
persist in psychotic patients, at least in a milder form, even if a com- 
plete remission would be achieved. All subjects were administered 
tests for cognitive profiling that measure functions previously shown 
to be impaired in schizophrenia patients, including attention, spatial 
working memory, logical memory, executive function, cognitive flex- 
ibility, language and processing speed (see Methods). On all tests, the 
schizophrenia patients performed worse than population controls 
(Fig. la and Supplementary Table 3a). The neuropsychiatric CNV control 
carriers performed at a level between that of schizophrenia patients and 
population controls, whereas the controls carrying other CNVs per- 
formed in line with population controls (Fig. 1a and Supplementary 
Table 3a). For all the tests, the association is much weaker when IQ is 
taken into account (Fig. 1b and Supplementary Table 3b). This is not 
surprising as the cognitive tests measure attributes that contribute to IQ. 

Scores on the adult reading history questionnaire (ARHQ)’”” and 
adult mathematical history questionnaire (AMHQ) questionnaires (see 
Methods), designed to detect a history of reading and mathematics 
learning difficulties indicative of dyslexia and dyscalculia, separate the 
control neuropsychiatric CNV carriers from population controls with 
an effect of 0.50 s.d. (P= 3.1 X 10 °) and 0.55 s.d. (P= 2.5 X 10 ’), 
respectively (Fig. la). ARHQ and AMHQ scores for the carriers of other 
CNVs are not significantly different from those of population controls 
(P = 0.98 and 0.17, respectively). 


Figure 1 | Association of CNV groups with cognitive traits, GAF, ARHQ 
and AMHQ scores. a, Average standardized scores for schizophrenia patients 
(n = 161), control carriers of neuropsychiatric CNVs (n = 167), control 
carriers of other CNVs (n = 465) and population controls (n = 475). 

b, Average standardized scores after adjustment for IQ. AMHQ, adult 
mathematical history questionnaire; ARHQ, adult reading history 
questionnaire; CF, category fluency; GAF, global assessment of functioning; 
LF, letter fluency; LM I and II, logical memory I and II; Pers. errors, 
perseverative errors; P IQ, performance IQ; RVIP, rapid visual information 
processing; Stroop, difference in time to complete trial 3 and time to complete 
trial 2; Stroop 1, Stroop trial 1; SWM, spatial working memory (between- 
search errors for 6 boxes); TMT, TMT trail B — TMT trail A; TMT A, TMT 
trail A; V IQ, verbal IQ; (see Methods for further information on tests). Error 
bars represent s.e.m. Impairment is in s.d. units, ARHQ and AMHQ scores for 
the patient group are not available. 


Individual neuropsychiatric CNVs 


To determine whether the CNVs differ in their effects on cognition, 
we examined the association of individual CNVs with neurocognitive 
traits, GAF score and history of learning difficulties. Few control car- 
riers could be recruited for some of the neuropsychiatric CNVs but 
between 5 and 47 control carriers could be evaluated for each of 11 
CNVs (Supplementary Table 2). Six of the CNVs are associated with 
verbal and/or performance IQ with large effects (0.73-3.51 s.d. units) 
in the carrier controls. These are the 16p11.2 deletion and the recip- 
rocal duplication, 17p12 deletion, 17q12 duplication, 16p12.1 deletion 
and 16p13.1 duplication (Table 2). The effect is also large for the 2p16.3 
deletion carriers for performance IQ, although the P value is >0.005 
(Supplementary Table 4a). Significant associations were also found 
between individual neuropsychiatric CNVs and GAF, spatial working 
memory (SWM), AMHQ, category fluency, letter fluency, persevera- 
tive errors and Stroop trial 1 (Table 2). 

The alleles of the 16p11.2 CNV confer mirrored effects on anthro- 
pometric traits’*. The deletion, conferring high risk of autism", shows 
the greatest impairments in the cognitive domains tested in the control 
carriers. The reciprocal duplication, conferring risk of schizophrenia’ 
and autism", confers somewhat different abnormalities on the control 
carriers (Supplementary Table 5). Although the deletion is strongly 
associated with impaired verbal IQ and deficits in verbal letter and 
category fluency tests, in keeping with what is seen in autism, the dupli- 
cation more selectively impairs the spatial working memory and execu- 
tive functions that seem to be more important in the pathophysiology 
of schizophrenia’®. 

Four neuropsychiatric CNVs, duplications at 13q31.3, 22q11.21 
and 1q21.1 and a deletion at 15q11.2(BP1-BP2) show more modest 
(around 0.5 s.d. or less) or no effects on verbal and performance IQ 
(Supplementary Table 4a). Twenty-one control subjects carrying the 
22q11.21 duplication were evaluated and trends were seen for impair- 
ments in all neurocognitive traits and the most significant impairment 
was observed in category fluency (0.97 s.d., P= 1.4 X 10° *) (Supplemen- 
tary Table 4a). Ten control subjects carrying the 1q21.1 duplication were 
evaluated and not even a nominally significant effect was detected on 
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neurocognitive traits, GAF or history of learning difficulties (Supplemen- 
tary Table 4a). Forty-seven control subjects carrying the 15q11.2(BP1-BP2) 
deletion were evaluated, and significant associations were observed with 
a lower GAF score (0.66 s.d., P= 9.9 X 10°), history of learning dif- 
ficulties as evaluated by the ARHQ (0.70 s.d., P= 1.9 X 10 *) and the 
AMHQ (0.78 s.d., P= 2.3 X 10°) (Supplementary Table 4a). Asso- 
ciation with a lower GAF score indicates impaired functioning, pos- 
sibly due to some psychological disturbance, although the number of 
carriers studied was too small to allow detection of association with the 
individual phenotypes as derived from the MINI. 

When conditioned on IQ the associations with specific cognitive 
traits, GAF and history of learning difficulties become less significant 
for the 11 CNVs (Table 2 and Supplementary Table 4a). However, the 
association of AMHQ score with the 15q11.2(BP1-BP2) deletion remains 
the most significant (0.70 s.d., P= 2.3 X 10°“). In Fig. 2a and Sup- 
plementary Table 6 the neuropsychiatric CNV carriers are divided into 
those carrying the 15q11.2(BP1-BP2) and those carrying other neu- 
ropsychiatric CNVs. A clear difference in the effect of conditioning the 
ARHQ and AMHQ scores on IQ is observed in these two groups: in 
the 15q11.2(BP1-BP2) deletion group, the association with ARHQ and 
AMHQ scores is only slightly weakened when conditioned on IQ, whereas 
in the group of remaining neuropsychiatric CNV carriers (without the 
15q11.2(BP1-BP2) deletion carriers) there is no longer any significant 
association with the history of learning difficulties after conditioning 
on IQ (Fig. 2b and Supplementary Table 6). 

The 15q11.2(BP1-BP2) deletion has previously been shown to confer 
modest risk of schizophrenia’, behavioural disturbances’’, developmental 
and language delay'’, and epilepsy’’. We show that the 15q11.2(BP1- 
BP2) deletion has only modest impact on results of the neuropsycholo- 
gical tests but is still strongly associated with a history of difficulties in 
learning mathematics and reading (Fig. 2). IQ is only marginally lower 
in the controls carrying the 15q11.2(BP1-BP2) deletion than in the 
population controls. Using a score of greater than 0.43 on the ARHQ” 
as a surrogate for dyslexia”, the 15q11.2(BP1-BP2) deletion is associated 
with dyslexia with an odds ratio of 3.18 (P = 0.0017). Of three previ- 
ously described ARHQ subscales, based on factor analysis'’, the ARHQ 


Table 2 | Controls carrying different neuropsychiatric CNVs perform worse than population controls on cognitive tests, GAF and history of 
learning difficulties 

CNV Cognitive trait Effect P-value Effect (adjusted for 1Q) P-value (adjusted for IQ) 
16p11.2 de VIQ 3.51 5.90 x 10-16 NA NA 
17p12.de VIQ 2.99 2.30 x 10°? NA NA 
16p11.2 de LF 2.00 2.00 x 10°” 0.61 0.14 
16p11.2 de P1Q 2.01 1.30 x 10°© NA NA 
16p11.2 de CF 1.83 2.00 x 10°° 0.58 0.16 
16p11.2 de Stroop 1 18 2.80 x 10°© 1.14 0.006 
16p12.1 de VIQ 2.05 8.30 x 10°° NA NA 
16p13.1 dup PQ 1.09 9.30 x 10°° NA NA 
16p11.2 de Pers. errors L777 2.00 x 10°° 0.48 0.25 
15q11.2 de AMHQ 0.78 2.30 x 10°° 0.70 0.00023 
16p11.2 dup SWM 1.72 3.20 x 10°® 151 0.00025 
17q12 dup GAF 1.63 5.10 x 107° 1.43 0.00037 
16p11.2 de GAF 1.55 5.80 x 10°° 0.58 0.10 
17q12 dup VIO 1.57 8.10 x 10-5 NA NA 
15q11.2 de GAF 0.66 9.90 x 10°° 0.57 0.0012 
16p11.2 de SWM 1.49 0.00011 0.45 0.27 
22q11.21 dup CF 0.97 0.00014 0.81 0.0016 
15q11.2 de ARHQ 0.7 0.00019 0.60 0.0018 
17p12 del GAF 1.67 0.00031 1.11 0.021 
16p11.2 de TMTA 1,37 0.0004 0.4 0.33 
17p12 del Stroop 1.61 0.00043 1.13 0.018 
16p11.2 dup PIQ 1.29 0.00062 NA NA 
16p11.2 dup TMT 1.27 0.00073 0.91 0.016 
17p12 del CF 1.48 0.0012 0.62 0.2 
16p12.1 del PIQ 1.41 0.0021 NA NA 
16p13.1 dup V1IQ 0.73 0.0022 NA NA 
17q12 dup LF 12 0.0026 0.78 0.051 
16p12.1 del LM | and LM Il 1.25 0.0027 0.77 0.092 
16p13.1 dup SWM 0.66 0.0049 0.54 0.026 


Abbreviations for the different tests are given in the supplementary text. The significance threshold for the 11 CNVs each compared for 15 tests and 13 IQ-adjusted tests is P = 0.00016. AMHQ, adult mathematical 
history questionnaire; ARHQ, adult reading history questionnaire; CF, category fluency; GAF, general assessment of function scale; LF, letter fluency; LM | and LM Il, logical memory | and II; NA, not applicable; Pers 
errors, perseverative errors; P |Q, performance IQ; Stroop, difference in time to complete trial 3 and time to complete trial 2; Stroop 1, Stroop trial 1; SWM, spatial working memory; TMT A, TMT trail A; V IQ, verbal IQ. 
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Figure 2 | Association of CNVs with cognitive traits, GAF, ARHQ and 
AMHQ scores. a, Average standardized scores for controls carrying 
neuropsychiatric CNVs excluding the 15q11.2(BP1-BP2) deletion carriers 
(blue, n = 120), controls carrying the15q11.2(BP1-BP2) deletion (cyan, n = 47) 
and population controls (black, n = 475). b, Average standardized scores 
conditioned on IQ. Error bars represent s.e.m. 


dyslexia-symptoms subscale shows the strongest association with the 
15q11.2(BP1-BP2) deletion (effect = 0.71 s.d., P= 1.4 X 10+, Sup- 
plementary Table 7). Based ona score of greater than 12 on the AMHQ 
(see Methods), the 15q11.2(BP1-BP2) deletion shows association with 
dyscalculia (odds ratio = 3.91, P = 0.00011). In 136 controls carrying 
the reciprocal 15q11.2(BP1-BP2) duplication the results are comparable 
to those of population controls on the neurocognitive tests (Supplemen- 
tary Table 4a), and their GAF (0.01 s.d., P = 0.95), ARHQ (—0.22 s.d., 
P= 0.057) and AMHQ (0.07 s.d., P = 0.52) scores are also in keeping 
with those of population controls. 


Structural MRI phenotypes 


In a search for neural intermediate phenotypes, we performed struc- 
tural magnetic resonance imaging (MRI) on 15 control carriers of the 
15q11.2(BP1-BP2) deletion, 55 carriers of the reciprocal duplication, 
and 201 population controls. Given the association with schizophrenia, 
we focused our attention on regions defined by a recent meta-analysis 
of first-episode psychosis” (Fig. 3a). The 15q11.2(BP1-BP2) deletion 
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carriers have a reduced volume of grey matter in the perigenual anterior 
cingulate cortex (pACC) (Fig. 3b) and the left insula (Fig. 3c). Further- 
more, deletion carriers showed reductions in white matter of the tem- 
poral lobe bilaterally and an increase in the volume of the corpus callosum 
(Fig. 3d). Importantly, for both grey and white matter, 15q11.2(BP1-BP2) 
duplication carriers always show reciprocal changes in exactly the same 
regions altered in deletion carriers, providing the first demonstration of 
allele-dose-dependent effects of CNVs on the structure of the human brain. 

The pACCisa key region for regulation of limbic activity” previously 
shown to be abnormal in schizophrenia”’. Furthermore, the frontoin- 
sular cortex is highly connected to the pACC, with which it forms the 
cortical aspects of the salience network”, a circuitry linked to schizo- 
phrenia risk**. Although reduction in the volume of the temporal lobe 
white matter is a well-established feature of schizophrenia and is pre- 
sent early in the illness”®, the finding of increased callosal volume was 
unexpected, as patients with schizophrenia have reduced volume in this 
region’’. Notably, carriers of the schizophrenia-associated 22q11.21 
deletion also show increased volume of the corpus callosum”*. 

The abnormalities found in the structural MRI studies also show 
overlap with published work on structural correlates of dyslexia and 
dyscalculia. In dyslexia, grey matter abnormalities in the supramar- 
ginal gyrus were prominent in a recent meta-analysis”. Grey matter 
reductions in a very similar location in pACC have also been seen in 
developmental dyscalculia*’. In both cognitive developmental disor- 
ders, other regions are abnormal that are not implicated in the present 
study (such as left perisylvian areas in dyslexia and the intraparietal sulcus 
in dyscalculia), suggesting that the neuropsychological impairment seen 
in 15q11.2(BP1-BP2) deletion carriers may be related to specific key 
nodes in the networks associated with these cognitive dysfunctions. 

It is of interest that the controls carrying the 15q11.2(BP1-BP2) 
duplication (n = 136) perform to a similar level as population controls 
on all tests of cognitive function used in this study (Supplementary 
Table 4a). Thus, although mirror effects on brain volume phenotypes 
are observed for the 15q11.2(BP1-BP2) CNV, we do not observe clear 
mirror effects on the ARHQ and AMHQ scores. 


Conclusion 
There were two main aims to this study. The first was to determine 
whether carriers of CNVs that predispose to schizophrenia and/or 
autism, who have not been diagnosed with a psychotic disorder or autism, 
have cognitive abnormalities that are akin to those encountered in schizo- 
phrenia. If this were the case these neuropsychiatric CNVs could be 
used as instruments in the study of cognitive abnormalities that char- 
acterize the disease. The results show that carriers of these CNVs show 
cognitive abilities in between those of normal controls and patients with 
schizophrenia. This raises the possibility that the difference between the 
patients and the control carriers may not be due to a lack of penetrance 
but instead to variation in expressivity of the CNVs. It also shows that 
the cognitive abnormalities are not necessarily consequences of the 
disease, and that the risk of the disease may, at least in part, be mediated 
through the cognitive abnormalities. These CNVs could be used to iden- 
tify individuals in whom schizophrenia-like cognitive abnormalities could 
be studied without the confounding effects of psychosis or medications. 
The second aim was to better define the cognitive abnormalities in 
population controls carrying CNVs associating with schizophrenia and 
autism; by evaluating controls carrying the neuropsychiatric CNVs we 
sought to learn more precisely which cognitive abnormalities put car- 
riers at risk of developing schizophrenia. We tested the carriers prim- 
arily for those aspects of cognition that have been shown to be abnormal 
in schizophrenia (Fig. 1), and in all of these aspects the control carriers 
were found to perform somewhere between the schizophrenia patients 
and the population controls. When controlled for IQ, the number of 
tests that separate the control neuropsychiatric CNV carriers from 
population controls decreases substantially, which is not surprising 
as these tests assess functions that are components of the IQ. Of the 
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Figure 3 | Dose-dependent alterations in brain structure in 15q11.2 
(BP1-BP2) CNV carriers. a, Regions of interest defined by a meta-analysis of 
first-episode psychosis in pACC and insula”’. b, Grey matter alterations in 
perigenual anterior cingulate cortex. c, Grey matter alterations in insula and 
supramarginal gyrus (SMG). d, White matter alterations in left temporal lobe 
and corpus callosum. All data are displayed on a coronal, horizontal and sagittal 
section (left to right) of a structural template MRI. Maps are displayed at a 


11 neuropsychiatric CNV alleles tested independently for association 
in 5 or more controls, 8 associated with a cognitive trait. 

Dosage-sensitive genes at CNV loci can give rise to mirrored phe- 
notypes for anthropometric loci including body mass index’ and head 
circumference’. In this study we provide the first evidence that dose- 
dependent effects of CNVs also affect human brain structure directly. Two 
brain regions, with clear evidence of both structural and functional altera- 
tions early in the course of schizophrenia, show dosage effects in controls 
carrying the 15q11.2(BP1-BP2) deletion and its reciprocal duplication. 

In this paper we demonstrate how cognitive abnormalities and changes 
in the structure of the brain observed in schizophrenia are also found in 
control carriers of CNVs that confer high risk of the disease. One of the 
missing pieces in our understanding of the pathogenesis of schizophre- 
nia has been the nature of the physiologic function that is first per- 
turbed in the disease or the perturbation of which leads to the disease. 
We show that carriers who have not been diagnosed with autism, intel- 
lectual disability, or schizophrenia show intermediate phenotypes in 
brain structure that are in good agreement with the observations in 
first-episode psychosis. We suggest that the work presented here lends 
support to the idea that the cognitive abnormalities are fundamental 
defects in schizophrenia as they are manifest in carriers of CNVs confer- 
ring risk of the disease who do not suffer from the disease. Furthermore, 
in addition to the information they may provide on disease, these CNVs 
provide us with an opportunity to search systematically for the bio- 
chemical foundations of the cognitive differences between the carrier 
and non-carrier controls. 


METHODS SUMMARY 


Control subjects carrying CNVs or not carrying CNVs were recruited from a 
large genotyped sample. Subjects aged 18 to 65 years were recruited for cognitive 
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threshold of P = 0.001 uncorrected for presentation purposes, but findings are 
significant at P< 0.05, family-wise error corrected for multiple comparisons. 
Right panels in b-d associate with the images to the left and show plots of 
contrast estimates extracted from the peak voxel in standard space (template for 
defining the standard space; Montreal Neuroimaging Institute). Bars represent 
means across subjects. Con, controls; Del, deletion; Dup, duplication. Error 
bars represent s.e.m. 


phenotyping. The psychologists and psychiatrists evaluating all subjects were 
blind to genotype. To examine fecundity, a nation-wide genealogy database 
was used to calculate fecundity of patients and controls carrying neuropsychiatric 
CNVs. The MINPF’ was used to screen the controls for psychiatric disorders, and 
participants’ overall level of functioning and their ability to carry out activities of 
daily living were rated using the GAF scale*!. Memory was assessed using the 
logical memory subtest from Wechsler Memory Scale III (WMS-III)*. Verbal 
fluency was assessed using the controlled oral word association test (COWAT)”? 
and the category naming test**. The Stroop test was administered as an indicator 
of the ability to suppress an habitual response’’. The trail-making test (TMT) was 
administered as a measure of psychomotor speed and mental flexibility**. As a 
further measure of mental flexibility, including the ability to alter cognitive sets, 
the Wisconsin card sorting test (WCST)” was administered and the ratio of 
perseverative errors to the number of trials administered used in our analysis. 
Spatial working memory and sustained attention were evaluated by the compu- 
terized CANTAB battery, using the SWM* and rapid visual information pro- 
cessing (RVIP) subtests*, respectively. Intelligence was evaluated using the 
Wechsler Abbreviated Scale of Intelligence (WASI-I)”°. For neuroimaging, MRI 
examinations were conducted on a 1.5 T whole body Philips Achieva scanner. 
High-resolution T1-weighted images were processed according to the unified 
segmentation model with SPM8 and Matlab 8b software. Copy-number effects 
were examined on a voxel-by-voxel basis with a multiple regression model using 
SPMB8. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Control subjects carrying or not carrying CNVs were recruited from a large 
genotyped sample. Subjects aged between 18 and 65 years were recruited for 
cognitive phenotyping. The psychologists evaluating all subjects were blind to 
genotype. A nation-wide genealogy database was used to calculate the fecundity 
of patients and controls carrying neuropsychiatric CNVs. The mini international 
neuropsychiatric interview (MINI)° was used to screen the controls for psychi- 
atric disorders, and participants’ overall level of functioning and their ability to 
carry out activities of daily living were rated using the general assessment of 
function (GAF) Scale*'. Memory was assessed using the logical memory subtest 
from Wechsler Memory Scale III (WMS-III)°. Verbal fluency was assessed using 
the controlled oral word association test (COWAT)” and the category naming 
test**. The Stroop test was administered as an indicator of the ability to suppress 
an habitual response*’. The trail-making test (TMT) was administered as a mea- 
sure of psychomotor speed and mental flexibility**. As a further measure of mental 
flexibility, including the ability to alter cognitive sets, the Wisconsin card-sorting 
test (WCST)* was administered and the ratio of perseverative errors to the number 
of trials administered used in our analysis. Spatial working memory and sustained 
attention were evaluated by the computerized CANTAB battery, using the spatial 
working memory (SWM)”* and rapid visual information processing (RVP) subtests”, 
respectively. Intelligence was evaluated using the Wechsler Abbreviated Scale of 
Intelligence (WASI-I)*°. For neuroimaging, MRI examinations were conducted on 
a 1.5 T whole body Philips Achieva scanner. High-resolution T1-weighted images 
were processed according to the unified segmentation model with SPM8 and 
Matlab 8b software. Copy number effects were examined on a voxel-by-voxel basis 
with a multiple-regression model using SPM8. 

Sample. Twenty-six CNVs conferring risk of psychiatric disorders (‘neuropsy- 
chiatric CNVs’), of which most are recurrent, were identified through literature 
search (Supplementary Table 1). Control subjects carrying neuropsychiatric CNVs 
were identified from a large genotyped sample (n = 101,655). The sample had been 
genotyped by Illumina HumanHap (300, 370, 610, 1M, 2.5M) and Illumina Omni 
(670, 1M, 2.5M, Express) SNP arrays. BeadStudio (Illumina; version 2.0) was used 
to call genotypes, normalize signal intensity data and establish the log R ratio and B 
allele frequency at every SNP. Samples passing quality control were examined 
using PennCNV™. All putative neuropsychiatric CNVs and other CNVs not 
known to be associated with schizophrenia or autism (‘other CNVs’) were visually 
inspected using DosageMiner software (developed by deCODE genetics). The 
neuropsychiatric CNVs, with one exception, span more than 15 SNPs on the 
Illumina arrays (Supplementary Table 1). 

Both the neuropsychiatric CNVs and the other CNVs are large, on average 
around 1.5 and 1.0 Mb, respectively. All 26 neuropsychiatric CNVs delete or 
duplicate exons of genes, whereas 89 of the 94 other CNVs delete or duplicate 
exons of genes (Supplementary Table 8). 

In the sample of 101,655 genotyped subjects we identified 1,178 subjects car- 
rying one or more of the neuropsychiatric CNVs (1.16% of the sample). Carriers 
aged between 18 and 65 were recruited for further phenotyping and were excluded 
from control groups if any of the following applied: if they were diagnosed with 
schizophrenia, schizoaffective or bipolar disorder; if they were diagnosed with 
autism, intellectual disability or developmental delay at the State Diagnostic and 
Counseling Centre in Iceland serving children and adolescents with a disability; if 
they met psychoses criteria on the MINI interview; if they were diagnosed with 
schizophrenia, schizoaffective, bipolar disorder, autism, intellectual disability or 
developmental delay according to self reports (or reports from parents); if they 
were using antipsychotic drugs. 

Phenotyped control subjects passing the exclusion criteria were: 167 controls 

carrying neuropsychiatric CNVs; 465 controls carrying otherCNVs: 475 controls 
without large CNVs. In addition, 161 schizophrenia patients were recruited for 
the neuropsychological phenotyping. 
Phenotyping. Encrypted identifiers of subjects were decrypted by a represent- 
ative of the Icelandic Data Protection Authority and subjects were recruited to the 
study by a clinic overseen by the Icelandic Data Protection Authority. Psychologists 
and nurses phenotyping the participants were blind to genotype. Those working 
with the genetic data were blind to personal identifiers and could only work on the 
encrypted data set. Only a representative of the Data Protection Authority of 
Iceland holds the key for encrypting and decrypting the personal identifiers. 
Genotypes are only linked to encrypted identifiers. Approval for this study was 
obtained from the National Bioethics Committee of Iceland and the Icelandic Data 
Protection Authority. Written informed consent was obtained from all partici- 
pants or their guardians before blood samples or phenotypic data were obtained. 
All sample identifiers were encrypted in accordance with the regulations of the 
Icelandic Data Protection Authority. 
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GAF” score was used to rate participants overall level of functioning and their 
ability to carry out activities of daily living. The scale was rated by the tester with 
respect to psychosocial, social, and occupational functioning. All participants 
were also interviewed using the MINI? edition 5.0.0. The MINI was designed as 
a brief structured interview for the major Axis I psychiatric disorders in ICD-10 
and DSM-IV. 

Toassess cognitive function, logical memory I and II from the Wechsler Memory 
Scale (3rd edn) (WMS-III)** was used to assess memory. An Icelandic translation 
of the test was used (unstandardized). Two variables from the test were calculated; 
immediate memory that is the total item score immediately after the reading of 
story A and after both readings of story B, and delayed memory that is the total 
item score from both stories after 30 min delay. The average of the two scores was 
used in the analysis. 

Verbal fluency was assessed using the COWAT” and the category naming 
test*’; animal naming. In COWAT the subject is required to name as many words 
as he or she can that begin with a certain letter in one minute, the letters H and S 
were used in the Icelandic translation (unpublished). For the analysis a mean 
score of the number of words registered with each of the two letters were calcu- 
lated (verbal fluency) and for category fluency the number of animals registered 
(category fluency). 

The Stroop test®* is a measure of selective attention and the ability to block out 
irrelevant stimuli. An Icelandic translation (unpublished), derived from the 
Golden version”, was used in this study. In the first trial the participant is asked 
to read the names of colours written in black ink. In the second trial the particip- 
ant has to name the colour of words written in coloured ink, and in the last (the 
main) trial the participant has to name the colour of the ink of a word which is 
actually the name of another colour. Two measures from the Stroop test were used 
in the analysis; the time it took to finish trial 1, and the interference score which is 
the difference in time to complete trial 3 minus the time for completing trial 2. 

For visual scanning and mental flexibility, TMT A and B* were used adminis- 
tered. Trail A is a measure of psychomotor speed and attention and trail B is 
thought to bea test of flexibility of thinking. A measure of the time it took to finish 
trail A and a derived score of the time it took to finish trail B minus trail A was 
used in the analysis. 

The WCST was designed to assess abstract reasoning and the ability to shift 
cognitive strategies in response to environmental cues*’. A computerized version 
of the test was used**. The variable used in our analysis is per cent perseverative 
errors, which reflects the ratio of perseverative errors to the number of trials 
administered. Perseverative errors are made when participant persists in respond- 
ing to the old rule after the rule has changed. 

The SWM subtest from the CANTAB battery was used, which gives a measure 
of spatial working memory**. The measure used in this analysis was between- 
search errors for 6 boxes, which is a count of times the subject revisits a box where 
the token has previously been found. This is thought to rely on the long-term 
spatial memory system as the subject has to remember the location for some time 
and through interferences*. 

RVIP, a subtest from the CANTAB battery was used to access vigilance, which 
is the ability to sustain attention on one or more items over a period of time*’. The 
main variable is A’, which is a signal detection measure of sensitivity to errors 
regardless of error tendency. This is a measure of the subjects ability to detect 
target sequences by using p(hit) and p(false). p(hit) is the probability of a hit; the 
proportion of correct responses that are given when a target sequence is presented 
on the screen. p(false) is the probability of a false alarm; the proportion of res- 
ponses when there is no target sequence presented on the screen. 

Intelligence quotient (IQ) was assessed using the WASI-I. The WASI-I test 
includes four subtests: vocabulary and similarities, both tests of verbal IQ, and 
matrix reasoning and block design, both tests of performance 1Q”. 

The WASI-I test has been translated into Icelandic for standardization that is in 
progress and this study has been a part of that work. Here the healthy control 
group was used to make local norms by calculating the mean and s.d. for each of 
the age groups used in the US version of the WASI-I. Z-scores were then calcu- 
lated for every subtest for each participant, and the mean of the subtests was 
transformed into an IQ score having a mean of 100 and a s.d. of 15 in the healthy 
control group. 

A fraction of the participants were tested with an older translation of the 
WASI-I with two subtests, vocabulary and matrix reasoning. Thirty-nine subjects 
were tested using both editions of the test (with more than one year apart), and 
Pearson’s correlation coefficient between the two measures in these subjects was 
0.66. No significant effect of test type (new or old translation of WASI-I version) 
was found in any group included in the study. 

The adult mathematical history questionnaire (AMHQ) described here is 
modelled after the adult reading history questionnaire (ARHQ)’”. The AMHQ 
consists of six questions, each scored on a Likert-type scale ranging from 0 to 4. 
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The questions were: 1. Did you experience any difficulties in learning math in 
elementary school? 2. How was your math performance compared to your class- 
mates in elementary school? 3. How do you rate your math skills now compared 
to people your age with a comparable education level? 4. Did you experience any 
difficulties learning the multiplication table in elementary school? 5. How much 
extra help did you need when learning math in elementary school? 6. What is your 
current attitude towards math? 

The six questions were selected to assess the degree to which adults have experi- 
enced symptoms of specific disorder of arithmetical skills, or dyscalculia (F81.2), 
which according to the ICD- 10 criteria’, “Involves a specific impairment in arith- 
metical skills that is not solely explicable on the basis of general mental retardation 
or of inadequate schooling. The deficit concerns mastery of basic computational 
skills of addition, subtraction, multiplication, and division rather than of the more 
abstract mathematical skills involved in algebra, trigonometry, geometry, or calculus.” 

The score for the AMHQ scale ranges from 0 to 24 with higher scores indi- 
cating greater impairment. Internal consistency reliability (a = 0.90) was assessed 
using Cronbach’s « from AMHQ results of a large survey sample (n = 2,757). An 
exploratory factor analysis of this data set combining 28 items from ARHQ-Ice 
(22) and AMHQ (6) found that association with all three previously reported 
ARHQ subscale factors (dyslexic symptoms, current reading and memory)" were 
replicated, and all six AMHQ items had high factor loadings (= 0.55) ona separate 
fourth factor. This further confirms the internal consistency of the AMHQ scale 
and suggests an independence of the arithmetical disorder scale from the ARHQ 
total scale representing specific reading disorder or dyslexia and its three subscales". 
Concurrent validity was assessed by comparing AMHQ scores of adults (n = 39) 
who had by formal neuropsychological evaluation been diagnosed as children with 
specific disorder of arithmetical skills (F81.2) and population controls without 
diagnosis of psychiatric or learning disorders and no learning disorder by self- 
report (1 = 564). A significant difference in mean AMHQ scores was observed for 
these groups; that is, 17.8 (s.d. = 6.5) and 8.1 (s.d. = 5.3), respectively (P < 0.001). 

For statistical analysis of cognitive traits, scores from each cognitive test or 
questionnaire were inverse normally transformed. They were then adjusted for 
sex, age at testing and, where indicated, IQ based on data from controls only. Final 
scores were shifted and scaled so that controls had a mean of 0 and a standard 
deviation of 1, and also arranged so that higher scores indicated greater impair- 
ment. To take the information on relatedness of the individuals into account, 
CNV carriers or schizophrenia patients were compared with controls using gen- 
eralized least-squares regression with a variance-covariance matrix based on the 
kinship coefficient of each pair of individuals. Meiotic distance between neurop- 
sychiatric CNV control carriers evaluated for cognitive traits can be found in Sup- 
plementary Table 4b. 

The sample sizes obtained resulted in about 80% power to detect a difference of 
around 0.4 s.d. in the neuropsychiatric CNV control or schizophrenia versus popu- 
lation control comparisons, and about the same amount of power to detect a 
difference of around 0.3 s.d. for the other CNV control versus population control 
comparison. For the individual neuropsychiatric CNVs with the smallest sample 
size (n = 5), there was approximately 80% power to detect a difference of about 2.5 s.d. 
Fecundity. deCODE genetics has built a nation-wide genealogy database for its 
genetic studies. The database contains information on year of birth and numbers 
of children of Icelanders. An encrypted version of the genealogy database was 
used for studying the fecundity in patient and CNV groups. 


Mixed-effects Poisson generalized linear models (GLMs) were used to examine 

the association of fecundity with various neuropsychiatric CNVs and schizophre- 
nia. The number of children at age 45 or older was regressed on sex, year of birth 
(included as factors for each 5 year birth cohort), sex-year of birth interaction (for 
each birth cohort factor), sibship (to account for relatedness) and the CNV or 
disorder of interest. All predictors were modelled as fixed effects except for sibship, 
which was random. A second set of models including a sex-CNV/disorder inter- 
action term were also fit. 
Neuroimaging. MRI examinations were conducted on a 1.5 T whole body Philips 
Achieva scanner. Scans were performed with a sagittal 3D fast T1-weighted 
gradient echo sequence (TR 8.6 ms, TE 4.0 ms, flip angle 8 degrees, slice thickness 
1.2 mm, matrix 192 X 192, field of view 240 X 240 mm). Quality control of the 
MRI images consisted ofa test of image homogeneity covariance and noise estima- 
tion (VBM8 toolbox; Gaser, http://dbm.neuro.uni-jena.de/author/admin/) as well 
as visual inspection. 

For voxel-based morphometry, high-resolution T1-weighted images were pro- 
cessed according to the unified segmentation model** with SPM8 (statistical 
parametric mapping, Wellcome Department of Cognitive Neurology http://www. 
fil.ion.ucl.ac.uk/spm) and Matlab 8b software (The Mathworks). In brief, this 
method involves an iterated scheme of bias correction, segmentation into white 
matter, grey matter and cerebrospinal fluid and warping of prior images in ste- 
reotactic space to the data, which is repeated until no significant change occurs 
anymore. During normalization, images were interpolated to isotropic 1 X 1 X 1 
mm voxels. The VBM8-toolbox extends this model with a partial volume estima- 
tion to account for partial volume effects and the application of a spatially adaptive 
non-local means (SANLM) filter*’ for bias correction. Normalization to stereo- 
tactic space consisted of a linear affine registration and a linear deformation cor- 
responding to a high-dimensional DARTEL normalization’ implemented in 
VBM8. The resulting probability maps were modulated, that is, intensity-corrected 
for local volume changes during normalization, to make them more sensitive to the 
distribution of grey matter and white matter volume. Modulation was limited to 
nonlinear warping; global differences in brain volume were thus excluded in the 
modulated probability maps. The modulated maps were smoothed with a 12-mm 
FWHM kernel. 

For statistical analysis of neuroimaging data from MRI subjects carrying the 
15q11.2(BP1-BP2) CNV, copy number effects at 15q11.2 (duplication > control 
> deletion and deletion > control > duplication) on regional brain volume were 
examined on a voxel-by-voxel basis with a multiple regression model using SPM8; 
age and gender were included as covariates of no interest. 

An interaction between performance on neuropsychological tests that indi- 
cated a genetic dosage effect and copy number at 15q11.2, on regional grey matter 
volume was tested with a multiple regression analysis (SPM8); age and gender 
were included as covariates of no interest. 

Effects on grey matter volume were reported as significant when whole-brain 
voxel-level FWE-corrected P value was less than 0.05. Additional region-of-interest 
(ROI) analyses were performed in the following regions found to show both func- 
tional and structural abnormalities in a recent meta-analysis of subjects with high 
risk of schizophrenia”': anterior cingulate and medial frontal cortex, and bilateral 
insula extending into temporal and parietal cortex. Results of these ROI analysis 
were considered significant at P< 0.05 voxel level, FWE-corrected. 
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Biochemical reconstitution of topological 
DNA binding by the cohesin ring 


Yasuto Murayama’ & Frank Uhlmann! 


Cohesion between sister chromatids, mediated by the chromosomal cohesin complex, is a prerequisite for faithful chro- 
mosome segregation in mitosis. Cohesin also has vital roles in DNA repair and transcriptional regulation. The ring-shaped 
cohesin complex is thought to encircle sister DNA strands, but its molecular mechanism of action is poorly understood and 
the biochemical reconstitution of cohesin activity in vitro has remained an unattained goal. Here we reconstitute cohesin 
loading onto DNA using purified fission yeast cohesin and its loader complex, Mis4°°-Ss13°°“* (Schizosaccharomyces pombe 
gene names appear throughout with their more commonly known Saccharomyces cerevisiae counterparts added in 
superscript). Incubation of cohesin with DNA leads to spontaneous topological loading, but this remains inefficient. The 
loader contacts cohesin at multiple sites around the ring circumference, including the hitherto enigmatic Psc3°°@ sub- 
unit, and stimulates cohesin’s ATPase, resulting in efficient topological loading. The in vitro reconstitution of cohesin 
loading onto DNA provides mechanistic insight into the initial steps of the establishment of sister chromatid cohesion and 


other chromosomal processes mediated by cohesin. 


The cohesin complex is a central player in chromosome biology'®. 
Defects in cohesin and its regulators are responsible for chromosome 
missegregation in human cancers and are the cause of Cornelia de Lange 
syndrome, a severe developmental disorder®’. Despite notable advances*”®, 
our molecular understanding of cohesin function remains vague. The 
cohesin complex consists of a dimer of structural maintenance of chro- 
mosomes subunits, Psm15™! and Psm3°™*, long coiled coil proteins 
that interact at their hinge as well as their ABC-type ATPase head domains 
to form large proteinaceous rings'"'”. The head interaction is stabilized 
by the kleisin subunit Rad21°*". Several additional subunits associate 
with this ring assembly, including the essential Psc3°°° subunit, whose 
function remains poorly understood’*’*. Cohesin is thought to pro- 
mote sister chromatid cohesion by entrapping replicated sister chro- 
matids within the ring’s circumference’’, but how the Mis45°?_Ss135°4 
cohesin loader’, ATP hydrolysis by cohesin’’”° and cohesion estab- 
lishment during S phase’**!”* contribute to topological cohesin loading 
remains unknown. It also remains unknown whether the roles of cohesin 
outside of sister chromatid cohesion involve topological DNA binding. 


The cohesin loader binds to DNA 


We purified the fission yeast Mis4°“?-SsI3°“ cohesin loader com- 


plex after overexpression of its two subunits in fission yeast (Fig. la 
and Extended Data Fig. 1a). We used a similar strategy to purify the 
large Mis4°°* subunit by itself. On the basis of its hydrodynamic pro- 
perties, Mis45“?_Ss135°* is a moderately elongated, heterodimeric protein 
complex (Fig. 1b and Extended Data Fig. 1b). Because Mis4°“? contains a 
putative leucine zipper, we investigated DNA binding of Mis4*“?-Ssl3°°. 
We detected concentration-dependent DNA binding of Mis4°*“?-SsI3°°“* 
to double-stranded DNA (dsDNA) (Fig. 1c, d and Extended Data Fig. 1c). 
Single-stranded DNA (ssDNA) was bound poorly, whereas a Y-fork DNA 
substrate, mimicking open DNA structures that might exist at some 
physiological cohesin loading sites, showed no increased affinity com- 
pared to dsDNA. The dsDNA preference over ssDNA was confirmed 
in a competition assay (Extended Data Fig. 1d). DNA binding was stron- 
gest at low salt concentrations, but remained detectable under physio- 
logical conditions (Extended Data Fig. le). The Mis4°“* subunit alone 


Scc2 _ 


displayed DNA-binding properties indistinguishable from the Mis4 
Ss13°“* complex (Fig. 1d and Extended Data Fig. 1d). These results 
suggest that the cohesin loader makes direct contact with dsDNA, which 
the Mis4°°* subunit is largely responsible for. 
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Figure 1 | Mis4%“?-Ss13°“ is a DNA-binding protein. a, Purified 
Mis4°*?-Ss135°* complex (M43) and its Mis4°°* subunit (M4) were analysed 
by SDS-PAGE and Coomassie blue staining (CBB) or western blotting using 
antibodies directed against Mis4 or its carboxy-terminal haemagglutinin (HA) 
epitope. b, Glycerol gradient centrifugation of the Mis4°“?-Ssl3°°* complex, 
followed by SDS-PAGE and silver staining. Bovine serum albumin (BSA), 
catalase (Cata) and thyroglobulin (Tyr) were size markers. c, DNA 

binding of Mis4°“?-Ssl3°“, using an electrophoretic mobility shift assay. 

d, Quantification of DNA binding by Mis49°?-Ss13°°“ (left) and Mis4S¢? 
(right). Mean and standard deviation from three independent experiments 
are shown. 
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Figure 2 | In vitro reconstitution of cohesin loading onto DNA. a, Gel 
filtration of the cohesin complex. b, Purified cohesin, analysed by SDS-PAGE 
and Coomassie blue staining or western blotting as indicated. his, hexameric 
histidine; Pk, paramyxovirus SV5 Pk1 epitope. c, Schematic of the cohesin 
loading assay. d, Agarose gel electrophoresis and quantification of recovered 
DNA during the loading reaction. IP, immunoprecipitation. e, f, Time course 


Topological cohesin loading in vitro 


We purified the fission yeast cohesin complex after overexpression of 
its four essential subunits—Psm1, Psm3, Rad21 and Psc3—in budding 
yeast (Fig. 2a, b and Extended Data Fig. 2a). The complex showed ATP- 
independent dsDNA binding, as previously reported for cohesin*”’, 

that was independent of the substrate topology but was salt sensitive 
(Extended Data Fig. 2b, c). Topological DNA binding by cohesin is 
expected to be salt resistant’®’”"®. 

Taking these properties into account, we devised an assay to detect 
topological cohesin loading onto DNA (Fig. 2c). Cohesin and a relaxed 
circular DNA (rcDNA) substrate were mixed in the presence of ATP. 
After incubation, cohesin was immunoprecipitated. The cohesin beads 
were washed in high-salt buffer, and then DNA that remained bound 
was eluted and analysed by gel electrophoresis. 

In the absence of protein, or in the presence of Mis4°“?-Ss13°°* 
only, no DNA was recovered (Fig. 2d). About 5% of input DNA was 
recovered when we performed the reaction in the presence of cohesin. 
The amount of bound DNA increased fivefold when Mis4°°-Ss]3°°“* 
was also included. This suggests that the cohesin loader promotes salt- 
resistant cohesin loading onto DNA. The bulk of DNA binding occurred 
within 1 h, followed bya slower increase up to 4 h (Fig. 2e). DNA binding 
in the absence of Mis4°?-Ss13°°* similarly increased over time, albeit 
at a lower level, indicating that it might also be the consequence of an 
active loading process. The salt-resistant loading onto DNA required 
that the incubation itself was performed at low salt concentrations, 
excluding the possibility of a high-salt artefact (Extended Data Fig. 3a). 
Both Mis4*“? and the Mis4°“7-Ss13°™* complex caused indistinguish- 
able concentration-dependent stimulation of cohesin loading (Fig. 2f), 
suggesti ing that the ability to load cohesin onto DNA is contained within 
the Mis4°° subunit of the cohesin loader. 

Cohesin and its loader localize to specific chromosomal loci*”*. We 
modified our DNA substrate to contain fission yeast DNA sequences 
that are in vivo enriched for, or free from, Mis45°-Ssl3°°* and cohesin”>. 
No differences were observed when comparing cohesin loading using 
these substrates, indicating that DNA sequence does not affect cohesin 
loading, at least under our in vitro conditions (Fig. 2g). 


368 | NATURE | VOL 505 | 16 JANUARY 2014 


with or without Mis4°°?-Ss]3°°“ (e) and titration of Mis4°°?-Ss13°°* or 
Mis4°“” only (f). g, The loading reaction was carried out using rDNA 
containing fission yeast cohesin-bound (E1, E2) or -unbound (N) sequences. 
Mean and standard deviation from at least three independent experiments are 
shown in d, f and g. 


As expected, different closed circular DNAs served as efficient sub- 
strates in the loading reaction, but not linear DNA (Extended Data 
Fig. 3b). To confirm that the interaction was topological, we retrieved 
cohesin from a loading reaction, then linearized the bound circular 
DNA. This released cleaved DNA into the supernatant, whereas resi- 
dual uncleaved DNA remained cohesin-bound on the beads (Fig. 3a, b). 
Topological loading was also observed in a loading reaction contain- 
ing the Mis4°“ subunit only (Extended Data Fig. 3c). The lower level 
of DNA loaded onto cohesin in the absence of any loader was also released 
by linearization (Fig. 3b), suggesting that cohesin achieves topological 
loading onto DNA independently of a cohesin loader, albeit inefficiently. 
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Figure 3 | Topology-mediated DNA binding of cohesin. a, Schematic of the 
DNA-release experiment. b, Gel image of the experiment. Covalently closed 
circular (CCC) input DNA and linear form (L), supernatant (S) and bead (B) 
fractions of experiments in the presence or absence of Mis4°°?-SsI3°°", 

with or without PstI digestion, are shown. c, Diagram of the DNA-release 
experiment by TEV protease cleavage of Rad21. d, Agarose gel of recovered 
rcDNA and its quantification, the mean and standard deviation of three 
independent experiments, are shown. Rad21 cleavage was monitored by 
western blotting. 
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Cohesin dissociates from chromosomes to trigger anaphase follow- 
ing cleavage of its kleisin subunit by the protease separase’*”®. To test 
whether our in vitro-loaded cohesin recapitulates this, we replaced one 
of the two separase-recognition sequences in the kleisin subunit Rad21 
with a tobacco etch virus (TEV) protease-recognition motif (Fig. 3c). 
After loading this modified cohesin complex, we added TEV protease. 
This led to efficient Rad21 cleavage and concomitant loss of DNA 
from TEV-cleavable cohesin, but not from cohesin lacking the TEV- 
recognition site (Fig. 3d). Thus, cohesin achieves its final, cleavage- 
sensitive topological state on DNA from its initial loading. We conclude 
that topological loading onto DNA is an intrinsic activity of the cohe- 
sin complex, although it is greatly facilitated by the Mis4°“?-Ss13°°* 
cohesin loader complex, particularly the Mis4°? subunit. 


The loader engages cohesin’s ATPase 


To investigate the contribution of cohesin’s ATPase, we carried out load- 
ing reactions in the absence or presence of ATP, ADP or non-hydrolysable 
ATP analogues and found that ATP hydrolysis is required for cohesin 
loading onto DNA (Fig. 4a). Cohesin complexes harbouring ATPase- 
inactivating point mutations in the Walker A ATPase motif of Psm1 or 
Psm3 also failed to be loaded onto DNA by Mis4°*?-Ss13°°“ in vitro 
(Fig. 4b and Extended Data Fig. 2d). Therefore the ATP-hydrolysis- 
dependent DNA-binding step that has been inferred from mutant 
analysis in vivo'””° is most likely that of topological loading onto DNA. 

Fission yeast cohesin hydrolysed ATP at a rate of 3.7 min ', depen- 
dent on its intact ATPase (Fig. 4c, d). The addition of DNA or Mis4°°?— 
Ss13°*“* did not change this rate, but the addition of both led to a marked 
increase in ATP hydrolysis. This suggests that Mis4°-Ss]3°* stimu- 


lates cohesin loading once the loader, cohesin and DNA come together. 
The Mis4°“ subunit alone had a similar effect to the loader complex. 
Linear DNA wasas efficient as circular DNA in stimulating ATP hydrolysis 
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Figure 4 | Mis4°“?_§s13°“* stimulates cohesin’s ATPase. a, b, Loading 
reactions in the presence or absence of ATP or its derivatives (a), or comparing 
wild-type or Walker A (WA) motif mutant cohesin (b). c, Time-course analysis 
of ATP hydrolysis by cohesin with or without Mis4°°?-SsI3°“ or rcDNA. 

d, ATP-hydrolysis rates of wild-type and mutant cohesin derived from similar 
time-course analyses. rcDNA (C) or linear DNA (L) and a reaction with 
Mis4°* only was included. The mean and standard deviation of three 
independent experiments are shown. 
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(Fig. 4d), suggesting that DNA topology might not affect cohesin load- 
ing, but rather the retention of cohesin on DNA once loaded. 


Multiple interactions around the ring 


We next analysed the physical interaction between Mis4°?-Ss13°™* 
and cohesin. Consistent with previous observations’°”®, the two puri- 
fied complexes co-immunoprecipitated, suggesting that they directly 
interact. Mis4°“*-SsI3°°“* (or Mis4°“* by itself) also interacted with 
the purified Psm1-—Psm3 dimer, with the Psm3 subunit alone, as well 
as with the purified Psc3 subunit (Extended Data Fig. 4). To map the 
interacting regions, we used tiling peptide arrays covering each of the 
four cohesin subunits. They were probed with Mis4°“?-Ssl3°°* that 
was subsequently detected by immunoblotting. As an example, Fig. 5a 
shows the identification of a candidate Mis4°°?-Ssl3°“ interaction 
site on Psm1, close to its hinge. We identified putative loader inter- 
action sites on all four cohesin subunits, located at positions along the 
cohesin ring circumference (Fig. 5b and Extended Data Fig. 5). The 
contact regions within Psm1, Psm3 and Psc3 show evolutionary con- 
servation. In particular, two of the interacting regions in Psc3 delin- 
eate the ‘stromalin homology domain” that is characteristic of Psc3 
orthologues (Extended Data Fig. 6a-c). 

A second set of peptide arrays was used to identify amino acids cri- 
tical for these interactions. To verify the importance of the key residues 
identified (Extended Data Fig. 6d-f), we made corresponding point muta- 
tions in the psm1*, psm3* and psc3* genes and ectopically expressed 
them in fission yeast cells carrying temperature-sensitive mutations in 
the respective subunits (Extended Data Fig. 7). Point mutations in Psm3 
and each of the three interaction sites in Psc3 caused failure to com- 
plement temperature sensitivity. In the case of Psm1, we observed sen- 
sitivity to DNA-damaging agents, a known characteristic of reduced 
cohesin function?®. Thus, Mis4°°?-Ss13°° interacts with cohesin at 
multiple sites around its ring circumference, at places that are impor- 
tant for cohesin function. 


Ring contacts promote cohesin loading 


The interaction of Mis4°°-Ss]3°°“* with Psc3 was of particular inte- 


rest as no molecular role has yet been ascribed to this essential cohesin 
subunit’*”’’. The interaction between the two components was notice- 
ably reduced by point mutations in Psc3, particularly those within the 
stromalin homology domain (Fig. 5c). When expressed in fission yeast, 
all of the interaction site mutants failed to rescue the sister chromatid 
cohesion defect of a psc3-303 temperature-sensitive strain (Fig. 5d), 
confirming that interaction of Psc3 with the cohesin loader is impor- 
tant for sister chromatid cohesion. 

To define the role of Psc3 in cohesin loading, we carried out loading 
reactions with a cohesin trimer including Psm1, Psm3 and Rad21, and 
lacking Psc3. The cohesin trimer by itself did not detectably bind rcDNA, 
although addition of Mis4°°-Ssl3°“ resulted in a small degree of 
loading (Fig. 5e and Extended Data Fig. 8a). Adding back Psc3 to the 
cohesin trimer also gave a low level of DNA loading. However, Psc3 
together with Mis4°“?-Ss]3°°“ substantially restored cohesin loading. 
Likewise, DNA-dependent ATPase activity of the cohesin trimer was 
low, but was coordinately stimulated by addition of Psc3 and Mis4°°7- 
Ss13°°“ (Extended Data Fig. 8b). The rescue of cohesin loading by Psc3 
was reduced by point mutations in loader interaction sites, especially 
those in the stromalin homology domain (Fig. 5e). This effect was 
augmented at a higher incubation temperature of 37 °C, when small inter- 
action defects might be less well tolerated. Each of the loader interaction- 
deficient Psc3 proteins stimulated the loader-independent ATPase of 
the cohesin trimer to a similar extent as wild-type Psc3 (Extended Data 
Fig. 8c), suggesting that they retain functional interactions within the 
cohesin complex. Thus, a specific interaction of Psc3 with Mis4°°?- 
Ss13°*“* is required to promote cohesin loading. 

We also investigated whether interaction of Mis4°“-SsI3°““* with 
Psm1 and Psm3 contributes to cohesin loading. We introduced loader 
interaction site mutations into both Psm1 and Psm3. This did not affect 
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Figure 5 | Mis4$*?_SsI3°““ interactions around the cohesin ring. 

a, Identification of a Mis4°“-Ssl3°“ interaction site on a Psm1 tiling peptide 
array. Starting positions of the first peptide in every other row are indicated. 
b, Summary of Mis4°°?-SsI3°°“ interaction sites on the four cohesin subunits 
c, Co-immunoprecipitation of wild-type (WT) and interaction site mutant 
Psc3-Pk with Mis4°°?-Ss13°°“4. d, Rescue of sister chromatid cohesion after 
psc3-303 inactivation by wild-type or interaction site mutant Psc3. 100 cells 
were scored each in three independent experiments; means and standard 


cohesin complex stability or its loader-independent association with 
DNA (Fig. 5f and Extended Data Fig. 2d). However, loading was sti- 
mulated less than 1.5-fold by Mis4°“7-Ss13°°, whereas DNA loading 
of wild-type cohesin complexes is stimulated over fourfold by the loader. 
This suggests that contacts of the cohesin loader with Psm1 and Psm3 
also contribute to cohesin loading. 

In a complementary approach, we observed competition of the 
loading reaction by peptides corresponding to Mis4°“*-Ss]3°* inter- 
action sites on Psm1, Psm3 and Psc3. In vitro cohesin loading onto 
DNA was reduced by addition of each individual peptide, an effect 
that was augmented by combining the three peptides (Fig. 5g). Similar 
addition of peptides with a scrambled sequence, or containing point 
mutations that reduce the Mis4°“?-Ss13°°“4 interaction, did not elicit 
inhibition. Peptide addition did not affect Mis4*“?-Ssl3°°*-independent 
DNA loading of cohesin, indicating that the interaction site peptides 
target the cohesin loader. Together, our results suggest that multiple 
interactions of the cohesin loader along the circumference of the cohe- 
sin ring jointly promote cohesin loading onto DNA. 


Discussion 


We have reconstituted topological loading of cohesin onto DNA using 
purified fission yeast proteins. The biochemical characterization of this 
reaction has led to several surprises. Cohesin topologically binds to DNA 
independently ofa loading factor or of cohesion establishment reactions. 
We believe that our experiments using purified components provide 
final proof for the idea that cohesin entraps DNA”, which has been 
previously supported by evidence from less-well-defined and therefore 
less-conclusive systems’’. A second structural maintenance of chromo- 
somes (SMC) family ring complex, condensin, is thought to associate 
with budding yeast chromatin by topological embrace”. Condensin 
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deviations are shown. e, Loading reactions of a cohesin trimer were 
supplemented with wild-type or interaction site mutant Psc3. f, Wild-type 
and loader interaction site mutation (Psm1/3!™) cohesin complexes were 
compared. g, Wild-type (+), scrambled (sc) or point mutant (m) peptides 
corresponding to Mis4°“-Ssl3°““ interactions sites were included in loading 
reactions. Panels e-g present means and standard deviations from three 
independent experiments. h, Model for cohesin loading onto DNA. 


binding to chromatin may not, or only in part, depend on Mis4°*°?- 


Ss13°“* in fission and budding yeast*”*". In the case of prokaryotic SMC 
complexes, which show considerable similarity to their eukaryotic 
counterparts®”, a loader similar to the Mis4°°?-Ssl3°* complex is 
unknown. Assuming that bacterial SMC complexes also embrace DNA, 
our results explain how SMC complexes can topologically bind DNA 
without the need for a loader. The intrinsic ability to entrap DNA also 
makes it possible that cohesin acts in DNA repair and transcriptional 
regulation by topological embrace. 

Cohesin loading onto DNA in vitro was greatly facilitated by the 
cohesin loader Mis4°°-SsI3°““, an essential protein complex in all eukar- 
yotes studied. Unexpectedly, the Mis4°“* subunit alone harboured all 
activities required for the cohesin loading reaction. These findings 
contrast with the essential nature of $sl3°°“ (refs 18, 33). Budding yeast 
$s13°““ is required for protein stability of the Mis4°°? subunit in vivo"’. 
In addition, Ssl3°°“ could have a role in cohesin loading in vivo that 
was not recapitulated in our in vitro loading reaction. Loading occurred 
in a sequence-nonspecific fashion in vitro, whereas the cohesin loader 
occupies discrete locations on chromosomes in vivo~*”*. In Xenopus 
egg extract, the cohesin loader is recruited to chromatin via an inter- 
action with pre-replicative complexes that requires its Ss]3°°“* subunit**. 
Ss]3°°“* could mediate recruitment of the cohesin loader in the context 
of a chromatinized DNA substrate to ensure that cohesin loading occurs 
with spatial and temporal precision. 

Mis4°°?-Ss13°°* makes numerous contacts with the cohesin com- 
plex, including a prominent interaction with the previously enigmatic 
Psc3 subunit. These jointly facilitate cohesin loading onto DNA. By 
engaging into multiple contacts, Mis4°“?-Ssl3°““ could stabilize a tran- 
sient conformation of the cohesin complex during the DNA-loading 
reaction that is otherwise energetically unfavourable (Fig. 5h). Recent 
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studies of the SMC-related DNA repair protein Rad50 have suggested 
how an ATP-hydrolysis-dependent conformational change of the 
ATPase domains could be transmitted along their coiled coil***’”. A 
similar ATP-dependent conformational change has also been impli- 
cated in ring opening of a mismatch repair ABC-type ATPase’. Given 
the large dimensions of the cohesin ring, Mis4°°*-SsI3°°* may func- 
tion as a molecular ‘shaft’ to help transmit an ATP-dependent con- 
formational change from the head domains along the coiled coil to the 
hinge*’. The Psc3 subunit in this model would act as a molecular ‘hitch’ 
to connect the ATPase ‘engine’ to the Mis4°“—SsI3°" ‘shaft’. This 
scenario offers a molecular explanation for how ATP hydrolysis at the 
cohesin heads leads to ring opening at the opposite side of the cohesin 
ring. The ‘shaft’ at the same time controls the ATPase engine to ensure 
it is only started when all components of the loading reaction are in place. 


METHODS SUMMARY 

Protein purification. The fission yeast Mis4 and cohesin complexes 
were overexpressed in fission and budding yeast cells, respectively, and purified 
using protein A affinity, heparin and size-exclusion chromatography. Psc3 was 
expressed and purified from Escherichia coli. 

In vitro cohesin loading assay. Mis4*“?-SsI3°““ and cohesin were incubated with 
arcDNA substrate in the presence of ATP at 32 °C for 1 h. Cohesin was retrieved 
by immunoprecipitation and, after high-salt washes, cohesin-bound DNA was 
analysed by agarose gel electrophoresis. 

Other assays. Electrophoretic mobility shift assays were carried out using **P- 
labelled 50-nucleotide oligonucleotides as DNA substrates and were analysed by 
polyacrylamide gel electrophoresis. ATPase activity was determined by monitor- 
ing the hydrolysis of [y-*’P]-ATP by thin-layer chromatography. Protein inter- 
actions were analysed by co-immunoprecipitation of purified Mis4S°?-Ss]3°°* 
and cohesin, or by probing peptide tiling arrays of the cohesin subunits, synthesized 
on cellulose membranes, with Mis4°°?-Ss13°° followed by its immunodetection. 
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METHODS 


Expression and purification of Mis4°“?-SsI3°“*. The mis4* complementary 
DNA was amplified by PCR from a fission yeast S. pombe meiotic cDNA library 
(Yeast National BioResource Project, Osaka, Japan) and cloned under control of 
the nmt1* promoter into plasmid pREP1 (LEU2). An HA epitope anda protein A 
tag, separated by a PreScission protease recognition sequence, were added at the C 
terminus, yielding plasmid pMis4PA. The ssI3* cDNA was similarly cloned under 
control of the nmt1* promoter, without epitope tags, into pREP2 (ura4*) gene- 
rating pSsl3. Both pMis4PA and pSsl3 were together introduced into fission yeast 
(h— ura4-D18 leu1-32) and protein expression induced by growing the cells in 
Edinburgh minimal medium (EMM2) lacking thiamine at 30°C for 15h. Cell 
pellets were re-suspended in an equal volume of CLR buffer (50 mM Tris/HCl, 
pH 7.5, 1 mM dithiothreitol (DTT), 250 mM NaCl, 2.5mM MgCl, 5mM EGTA, 
20% (v/v) glycerol, 0.2mM sodium vanadate, 0.5mM phenylmethylsulphonyl 
fluoride (PMSF) and a protease inhibitor cocktail (Roche)), frozen in liquid nitro- 
gen and then broken in a freezer mill (SPEX CertiPrep 6850). The cell powder was 
thawed at 4°C, then twice the volume of CLR buffer was added. The lysate was 
clarified at 4 °C by centrifugation at 45,000g for 30 min, and then at 200,000g for 
1h. The clarified lysate was mixed with IgG sepharose (GE Healthcare, 1 ml resin 
slurry per 50 ml of lysate) and RNase A (10 1g ml’ final) at 4 °C for 3h. The resin 
was washed with 15 bed volumes of R buffer (20 mM Tris/HCl, pH 7.5, 0.5 mM 
tris(2-carboxyethyl)phosphine (TCEP), 10% (v/v) glycerol) containing 250 mM 
NaCl and 0.5 mM PMSF, and then with 15 bed volumes of the same buffer lacking 
PMSF. The resin was then suspended in two bed volumes of R buffer contain- 
ing 250 mM NaCl, 10 pg ml * RNase A and 5 U ml’ of PreScission protease (GE 
Healthcare) and incubated overnight at 4 °C. For elution, 1.5 volumes of R buffer 
were added to adjust the salt concentration to 100 mM NaCl and the eluate was 
loaded onto a HiTrap Heparin HP column (GE Healthcare). The column was 
developed with a linear gradient from 100 mM to 1 M NaCl in R buffer. The peak 
fractions were pooled and loaded onto a Superdex 200 10/300 GL gel filtration 
column (GE Healthcare) that was developed in R buffer containing 150 mM NaCl. 
The peak fractions containing Mis4-Ssl3 were concentrated to 500 ul by centri- 
fugal ultrafiltration (Amicon Ultra, Millipore). Typically, 0.5 mg of Mis4—Ssl3 was 
obtained from 30 g of fission yeast cells. 

The Mis4 subunit was overexpressed and purified from fission yeast (h— ura4- 
D18 leu1-32), bearing pMis4PA only, following essentially the same procedure. 
Typically, 0.3 mg of Mis4 was obtained from 10 g of cells. 

Expression and purification of cohesin. The Psm3-encoding cDNA, fused to 
3 X Pk epitope and 7 X histidine tag at the C terminus, was cloned under control 
of the bidirectional budding yeast S. cerevisiae GALI-GAL10 promoter in the 
GAL] direction into the shuttle vector YIplac211 (URA3)*°. The cDNA encoding 
Psm1 was cloned into the same plasmid in the direction of the GAL10 promoter, 
yielding the plasmid YIpPsm1-Psm3. The rad21* cDNA, fused to an HA epitope 
and a protein A tag at the C terminus, separated by a PreScission protease recog- 
nition sequence, was cloned under control of the GAL1 promoter into YIplac128 
(LEU2). The psc3* cDNA, fused to a7 X histidine tag at the amino terminus, was 
added in the direction of the GAL10 promoter yielding the plasmid YIpRad21- 
Psc3. The linearized YIpPsm1-Psm3 and YIpRad21-Psc3 plasmids were sequen- 
tially integrated into budding yeast (MATa, ade2-1, trp1-1, can1-100, leu2,3,112, 
his3-11,15, ura3-52, pep4A::HIS3MX) at the URA3 and LEU2 loci, respectively. 
The resultant Psm1/Psm3/Rad21/Psc3-expressing cells were grown in YP med- 
ium containing 2% raffinose to an OD¢oo = 2.0 at 30 °C. Galactose was added to 
the culture at a final concentration of 2% and the cells were grown for further 4.5 h 
at 30 °C. Cell pellets after centrifugation were re-suspended in an equal volume of 
CLH buffer (50 mM HEPES/KOH, pH 7.5, 1 mM DTT, 300 mM NaCl, 20% (v/v) 
glycerol, 0.5 mM PMSF and protease inhibitor cocktail), frozen in liquid nitrogen 
and broken in a freezer mill. Cell lysate preparation and purification on IgG seph- 
arose followed the procedure for purification of Mis4-Ssl3, but CLH buffer and H 
buffer (25 mM HEPES/KOH, pH7.5, 0.5 mM TCEP, 10% (v/v) glycerol) contain- 
ing 300 mM NaCl were used. Two volumes of R buffer were added to the eluate to 
bring the salt concentration to 100 mM before loading onto a HiTrap Heparin HP 
column. Bound proteins were eluted with steps of 300 mM, 600mM and 1M 
NaCl in R buffer. Cohesin was retrieved in the 600 mM NaCl fraction. This 
fraction was applied to a Superose 6 10/300 GL gel filtration column (GE 
Healthcare) that was developed in R buffer containing 200 mM NaCl. The peak 
fractions were concentrated to 400 ul by ultrafiltration. Typically, 0.5 mg of the 
cohesin complex was obtained from 30 g of budding yeast cells. Cohesin contain- 
ing a Rad21 TEV cleavage site, cohesin bearing K38] mutations in the Walker A 
ATPase motif of Psm1 (Psm1™“) or Psm3 (Psm3”“) and the cohesin trimer lack- 
ing Psc3 were purified using the same procedure. For expression of the cohesin 
Psm1/3 loader interaction mutant (Psm1/3™, containing Psm1(K467E, R468E) 
and Psm3(R828E, R829E)), an additional plasmid expressing the Psm3 mutant 
under control of the GAL1 promoter, based on YIplac204, was integrated at the 


TRP1 locus. This plasmid also contained the budding yeast GAL4 gene under 
control of the GAL10 promoter to improve galactose-induced protein expression. 
Cohesin Psm1/3'™ was purified as above. 

Expression and purification of the Psm1-Psm3 heterodimer and of Psm3. For 
purification of the Psm1—Psm3 heterodimer, the psm1* cDNA was cloned into 
pREP1, yielding pPsm1. The psm3* cDNA, fused to a3 X Pk and 7 X histidine 
tag at the C terminus, was cloned into pREP2 to yield pPsm3PkH. Both proteins 
were simultaneously overexpressed in fission yeast and cell lysate was prepared 
essentially as described for Mis4-Ss]3 purification, using CLH buffer. The clarified 
lysate was mixed with 1 ml nickel-nitrilotriacetic acid (Ni-NTA) agarose slurry 
(Qiagen) per 50 ml of lysate for 4 h at 4 °C. The beads were washed with 30 volumes 
of H buffer containing 300 mM NaCl and the bound proteins were eluted with H 
buffer containing 300 mM NaCl and 250 mM imidazole. The eluate was loaded 
onto a Superose 6 10/300 GL gel filtration column that was developed in R buffer 
containing 200 mM NaCl. The peak fractions were concentrated by ultrafiltration. 

Psm3 was expressed in fission yeast cells harbouring the pPsm3PkH plasmid 

only. Purification followed the same protocol as described for the Psm1-Psm3 
dimer, but Talon metal affinity resin (Clontech) was used for the histidine affinity 
purification. 
Expression and purification of Psc3. The cDNA coding for Psc3, fused to a 6 X 
histidine tag at the N terminus, followed by a PreScission protease recognition 
sequence, was cloned into the pET30a bacterial expression vector. The resulting 
pET30a-Psc3 was transformed into E. coli BL21 (DE3) Rosetta. Fresh transfor- 
mants were grown in Luria-Bertani medium containing 50 pg ml~! kanamycin 
and 34ygml~' chloramphenicol to OD¢oo = 0.5 at 37°C. Isopropyl B-D-1- 
thiogalactopyranoside (IPTG; 0.5 mM) was added and cells were further grown 
at 18°C for 18 h. Cells were collected and re-suspended in 10 pellet volumes of H 
buffer containing 300 mM KCl and protease inhibitors and disrupted by sonica- 
tion. The lysate was clarified by centrifugation at 45,000g for 30 min at 4°C and 
mixed with 1 ml Ni-NTA agarose slurry per 50 ml of the lysate for 4h at 4°C. The 
beads were washed with H buffer containing 300 mM KCI and bound proteins 
were eluted in H buffer containing 300mM KCl and 250 mM imidazole. The 
eluate was adjusted to 100 mM KCland loaded onto a HiTrap Heparin HP column 
that was developed with a linear gradient from 100 mM to 1 M KClin H buffer. The 
peak fractions were pooled and applied to a Superdex 200 10/300 GL gel filtration 
column in H buffer containing 500mM KCI. The peak fractions were dialysed 
against H buffer containing 200 mM KCland concentrated to 500 ul by ultrafiltra- 
tion. Typically, 0.1 mg of his-Psc3 was obtained from 2.51 of culture, which was 
used for most of the experiments. 

Alternatively, wild-type and mutant versions of Psc3 were expressed and puri- 
fied using pGEX-6P-1 bacterial expression vector (GE Healthcare). Expression and 
lysate preparation were carried out as described above, except that 100 pgml! 
ampicillin instead of kanamycin was used for cell culture. The lysate was incubated 
with glutathione sepharose 4B (GE Healthcare) at 4°C for 4h. Beads were then 
washed with H buffer containing 300mM KCI and bound proteins were eluted 
with 20 mM glutathione in the same buffer. The eluate was treated with 5 U ml * 
PreScission protease at 4 °C overnight. The eluate was adjusted to 100 mM KCl in 
H buffer and loaded onto a HiTrap Heparin HP column. The column was 
developed with a linear gradient from 100 mM to 1 M NaCl in H buffer. The peak 
fractions were loaded onto a Superdex 200 10/300 GL gel filtration column that was 
developed in H buffer containing 200 mM KCL. The peak fractions containing Psc3 
were concentrated to 500 pl. These preparations were used for the experiment 
shown in Fig. 5e and Extended Data Fig. 8b, c. 

For use in the protein interaction studies shown in Fig. 5c and Extended Data 
Fig. 4c, we fused the psc3* cDNA C-terminally to a 3 X Pk followed by a7 X 
histidine tag (Psc3-PkH) and cloned it into the expression vector pGEX-6P-1 to 
yield plasmid pGEX-Psc3PkH. Protein expression, cell lysate preparation and 
glutathione affinity purification were carried out as above. After PreScission pro- 
tease treatment, the eluate was incubated with Ni-NTA agarose at 4°C for 4h. 
Beads were washed again with H buffer containing 300 mM KCl and the bound 
proteins were eluted in H buffer containing 300 mM KCl and 250 mM imidazole. 
The eluate was adsorbed to glutathione sepharose 4B to remove residual glu- 
tathione S-transferase, and the flow-through fraction was then dialysed against H 
buffer containing 300 mM KCL. 

DNA, antibodies and peptides. The covalently closed circular (CCC) plasmid 
was pBluescript KSII (+). It was purified by equilibrium centrifugation in a CsCl- 
ethidium bromide gradient. rcDNA was prepared by treating the CCC form with 
E. coli topoisomerase I (New England BioLabs). Nicked circular DNA (NC) was 
prepared by DNase I treatment of the CCC preparation in the presence of ethi- 
dium bromide". Linear DNA (L) was obtained from the CCC plasmid by restric- 
tion digestion using HindIII. Cohesin-bound DNA sequences from the fission 
yeast genome (E1, E2), and an equally sized control region (N; ref. 25), were amp- 
lified by PCR and cloned into the EcoRV restriction site in pBluescript KSII (+). 
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The coordinates of these regions on chromosome II were as follows: E1, 256411- 
261337; E2, 323792-328714; N, 244641-249740. The dsDNA and Y-fork struc- 
ture DNA substrate for the electrophoretic mobility shift assays were prepared by 
annealing 50-nucleotide oligonucleotides, OL1+ OL2 or OL1+OL3, respectively, 
followed by purification through polyacrylamide gel electrophoresis. The DNA 
sequences are listed in Supplementary Table 1. The sequences of the peptides used 
in the in vitro loading competition assay are found in Supplementary Table 2. The 
antibodies used for immunoprecipitation and western blotting were anti-Pk (V5, 
AbD Serotec), anti-HA (12C5A, Roche), anti-hexameric histidine (Novagen) and 
anti-tubulin (Cell Signaling). Antibodies raised against fission yeast Psm1, Rad21 
and Mis4 were from BioAcademia. 

DNA electrophoretic mobility shift assay. All indicated concentrations of the 
components are the final concentration in the reaction mixture. The 50-nucleotide 
DNA substrates were *’P-labelled at their 5’ ends using polynucleotide kinase. 
10 nM of the DNA substrates and the indicated concentrations of Mis4-Ssl3 were 
incubated in 25 mM Tris/HCl, pH 7.5, 0.5 mM TCEP, 50 mM NaCl, 0.1 mg ml! 
BSA and 8% (v/v) glycerol at 37 °C for 15 min. The reactions were then resolved 
using 5% acrylamide gel electrophoresis in 1 X TAE buffer at 4 °C. Gels were dried 
on DEAE paper. The gel image was visualized and band intensities were quantified 
using a Phosphorimager and ImageQuant software (GE Healthcare). 

In vitro cohesin loading assay. The standard reaction volume was 15 pl and 
contained 100 nM Mis4-Ssl3 and 3.3 nM molecules of rcDNA (pBluescript KSII 
(+); see above) which were mixed on ice in CL1 buffer (35 mM Tris/HCl, pH 7.5, 
1 mMTCEP, 25mM NaCl, 25mM KCl, 1mM MgCh, 15% (v/v) glycerol and 
0.003% Tween 20). After 5 min, 150nM cohesin and 100nM Psc3 were added 
for further incubation on ice for 5 min. Free Psc3 was added to the reaction because 
Psc3 appeared somewhat sub-stoichiometric in our cohesin preparations and DNA 
binding by cohesin slightly increased by this addition (Fig. 2b and Extended Data 
Fig. 3d). The loading reaction was then initiated by addition of 0.5 mM ATP 
followed by incubation at 32 °C for 1h. To stop the loading reaction and to disso- 
ciate non-topologically DNA-bound cohesin, 500 pil of CP buffer (35 mM Tris/ 
HCl, pH7.5, 0.5mM TCEP, 500mM NaCl, 10mM EDTA, 5% (v/v) glycerol, 
0.35% Triton X-100) was added to the reaction mixture and incubated at 32 °C 
for 5 min, followed by 5 min on ice. Anti-Pk-antibody-coated, protein-A-conjugated 
magnetic beads (Dynal Inc.) were added and rocked at 4 °C for 15 h. The magnetic 
beads were washed three times with CW 1 buffer (35 mM Tris/HCl, pH 7.5, 0.5 mM 
TCEP, 750 mM NaCl, 10mM EDTA, 0.35% Triton X-100), and then once with 
CW? buffer (35 mM Tris/HCL, pH 7.5, 0.5 mM TCEP, 100 mM NaCl, 0.1% Triton 
X-100). The beads were then suspended in 15 1] elution buffer (10 mM Tris/HCl, 
pH 7.5, 1 mM EDTA, 50mM NaCl, 0.75% SDS, 1mg ml! protease K) and 
incubated at 50°C for 20 min. The recovered DNA was analysed by 1% agarose 
gel electrophoresis in 1 x TAE and the gel was stained with GelRed (Biotium). Gel 
images were captured using a GelDoc-It Imager (UVP) and band intensities quan- 
tified using ImageQuant. 

In experiments that included linearization of cohesin-bound cccDNA, the 
cohesin-bound DNA was retrieved by anti-Pk immunoprecipitation as described 
above, except that EDTA was omitted from the CP and CW buffers. The magnetic 
beads were further washed with RE buffer (35mM Tris/HCl, pH 7.5, 0.5mM 
TCEP, 100mM NaCl, 10mM MgCh, 0.1 mg ml! BSA, 0.1% Triton X-100). 
The beads were incubated with PstI (20 U, NEB) in 10 pl RE buffer at 10 °C for 
3h. The salt concentration was then adjusted to 500 mM NaCl in 15 pl and the 
reaction mixture was incubated on ice for 10 min, before the DNA molecules in 
the supernatant and bead fractions were analysed as described above. 

The cleavage of engineered Rad21 by TEV protease was carried out at 16 °C for 
2h. TEV protease (10 U, Invitrogen) was added to the reaction mixture following 
the cohesin loading reaction. The cohesin-bound DNA was analysed as above. 

The cohesin loading reactions using cohesin trimer and Psc3 mutants were 
carried out in the standard reaction condition with sight modification. Mis4-Ssl3 
(100 nM) was initially mixed with rcDNA, and then 150 nM cohesin trimer and 
150 nM Psc3 were added. The reaction mixture was incubated at the indicated 
temperature for 10 min, and then the reaction was initiated by addition of 0.5 mM 
ATP. After 1 h incubation at the indicated temperature, cohesin-bound DNA was 
analysed as described above. 

For the peptide competition experiments, the cohesin loading reaction was 
carried out in CL2 buffer (50 mM Tris/HCl, pH7.5, 1 mM TCEP, 25mM NaCl, 
1mM MgCh, 15% (v/v) glycerol, 0.003% Tween 20). The indicated peptides (Psm1 
peptides, 100 11M; Psm3 peptides, 100 11M; Psc3-1 peptides, 50 1M) were included 
with 50 nM Mis4-Ssl3 and 3.3 nM rcDNA for incubation on ice for 10 min, before 
100 nM cohesin and 0.5 mM ATP were added to the reaction mixture for incuba- 
tion at 32 °C for 45 min. Cohesin-bound DNA was then analysed as above. 
ATPase assay. Cohesin (150nM), 100nM Mis4-Ssl3 and 3.3nM rcDNA were 
mixed in CL1 buffer on ice. For the analyses of the cohesin trimer, 150 nM cohesin 
trimer, 100 nM Mis4—Ss]3, 150 nM Psc3 and 3.3 nM rcDNA were used. Note that 
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no detectable ATP hydrolysis was observed in the absence of cohesin or cohesin 
trimer (Fig. 4c and data not shown). Reactions were initiated by addition of 
0.25 mM ATP, spiked with [y-*?P]-ATP (PerkinElmer), and incubated at 32 °C. 
Reaction aliquots were retrieved at 0, 15, 30 and 60min and terminated by 
addition of three volumes of 500 mM EDTA. 1 pl of the terminated reactions 
were spotted on polyethylenimine cellulose F sheets (Merck), and separated by 
thin-layer chromatography using 400 mM LiCl in 1 M formic acid as the mobile 
phase. The separated spots representing ATP and released inorganic phosphate 
were quantified using a Phosphorimager and ImageQuant software. 

Protein interaction analyses. For co-immunoprecipitation, 50 nM Mis4—Ssl3 and 
50 nM cohesin or cohesin subunits (Psm1—Psm3 dimer, Psm3 or Psc3-PkH) were 
mixed in 50 ul of IP buffer (25 mM Tris/HCl, pH 7.5, 0.5 mM TCEP, 100 mM NaCl, 
2.5mM MgCl, 0.2% Triton X-100, 5% (v/v) glycerol) containing 0.5 mg ml"? 
BSA and incubated at 25 °C for 15 min. After placing on ice for 15 min, the bind- 
ing mixtures were transferred to anti-Pk-antibody-coated, protein-A-conjugated 
magnetic beads and rocked overnight at 4 °C. The beads were washed three times 
with IP buffer. The bound proteins were eluted in SDS-PAGE loading buffer, 
separated by 8.5% SDS-PAGE and detected by western blotting using anti-HA 
antibody to detect Mis4 and anti-Pk to detect Psm3 or Psc3. For far-western 
analysis, purified cohesin proteins were separated by SDS-PAGE, and transferred 
to a nitrocellulose membrane. The transferred proteins were re-natured in FW 
buffer (25mM HEPES/KOH, pH7.5, 150mM KCl, 15mg ml! BSA, 0.05% 
Tween 20) at room temperature (23 °C) for 2 h. The membranes were then blocked 
in FW buffer containing 5% milk powder. After rinsing with FW buffer, the 
membrane was incubated with 2.5 jig ml~' Mis4-SsI3 in FW buffer at 4°C for 
15h. Bound Mis4-Ss]3 was detected by probing with an anti-HA antibody. For the 
interaction studies using tiling peptide arrays, 20-amino-acid-long peptides cover- 
ing the amino acid sequences of Psm1, Psm3, Rad21 or Psc3, shifted by two amino 
acids, were synthesized on cellulose membranes using an Intavis Multipep peptide 
synthesizer (Intavis Bioanalytical Instruments AG). The membrane was activated 
in 50% methanol/10% acetic acid, and then blocked with 2.5% milk powder in TBS 
(25 mM Tris/HCl, pH 7.5, 150 mM NaCl, 0.1% Tween 20) at room temperature for 
2h. The blocked membrane was incubated with 1 jg ml” * Mis4—Ssl3 in TBS buffer 
at 4°C for 15h. The membrane was washed with TBS and bound Mis4-Ssl3 was 
detected using an anti-HA antibody. 

Fission yeast strain, media and genetic methods. Standard methods were used 
for fission yeast cell propagation and genetic manipulation, as described*’. To 
investigate the consequence of mutations within the putative Mis4-Ssl3 interac- 
tions sites in cohesin, the psml", psm3* and psc3* open reading frames were 
cloned behind their upstream promoter sequences (951 base pairs (bp), 977 bp 
and 990 bp, respectively) into the pARG1H vector®. At the C terminus, each 
protein was fused to a 3 X Pk followed by 7 X histidine tag to facilitate detection. 
The resultant plasmids formed the basis for the introduction of point mutations 
to alter Mis4-Ss]3 interaction sites using site-directed mutagenesis. Plasmids 
were linearized and integrated into the arg1~ locus of the psm1-897, psm3-602 
and psc3-303 temperature-sensitive strains, respectively, obtained from the Yeast 
National BioResource Project. Protein expression was monitored by western 
blotting of yeast whole-cell extracts obtained by glass bead breakage and TCA 
precipitation. For analysis of temperature sensitivity and drug resistance, expo- 
nentially growing cultures were spotted in fivefold serial dilutions on YES agar 
plates and grown at the indicated temperatures for 3 to 5 days. 

Sister chromatid cohesion assay. The GFP-marked cen2 locus was introduced 
into the psc3-303 strain, followed by the vectors carrying wild-type psc3* or the 
respective loader interaction site mutant alleles. Equal Psc3 expression levels were 
confirmed by western blotting. Exponentially growing cultures were shifted to 
37 °C for 3 h. Cells were fixed with 70% ethanol. Cells were photographed and the 
percentage of cells with split GFP signals were counted using an Axioplan 2 Imag- 
ing microscope (Zeiss) equipped with a 100X/1.45 numerical aperture objective 
and an Orca-ER camera (Hamamatsu). At least 100 cells were scored for each strain, 
the experiment was performed three times, and the mean and standard deviations 
are shown. 
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UvrD facilitates DNA repair by pulling 
RNA polymerase backwards 


Vitaly Epshtein'*, Venu Kamarthapu’*, Katelyn McGary', Vladimir Svetlov', Beatrix Ueberheide’, Sergey Proshkin?, 


Alexander Mironov** & Evgeny Nudler)? 


UvrD helicase is required for nucleotide excision repair, although its role in this process is not well defined. Here we show 
that Escherichia coli UvrD binds RNA polymerase during transcription elongation and, using its helicase/translocase 
activity, forces RNA polymerase to slide backward along DNA. By inducing backtracking, UvrD exposes DNA lesions 
shielded by blocked RNA polymerase, allowing nucleotide excision repair enzymes to gain access to sites of damage. Our 
results establish UvrD as a bona fide transcription elongation factor that contributes to genomic integrity by resolving 
conflicts between transcription and DNA repair complexes. Furthermore, we show that the elongation factor NusA 
cooperates with UvrD in coupling transcription to DNA repair by promoting backtracking and recruiting nucleotide 
excision repair enzymes to exposed lesions. Because backtracking is a shared feature of all cellular RNA polymerases, we 
propose that this mechanism enables RNA polymerases to function as global DNA damage scanners in bacteria and 


eukaryotes. 


Nucleotide excision repair (NER) is the most versatile and evolutio- 
narily conserved mechanism used by prokaryotic and eukaryotic cells 
to repair diverse types of DNA lesions’. In bacteria, the general NER 
pathway commences when UvrA and UvrB proteins bind damaged 
DNA and recruit UvrC to cleave the impaired strand on both sides of 
the lesion. The resulting oligonucleotide is displaced by UvrD and/or 
DNA polymerase I, which fills the gap using the complementary strand 
as a template”. 

NER rates are usually greatest at transcriptionally active genes. 
Moreover, the transcribed DNA strand is preferentially repaired compared 
to the non-transcribed strand®*. This phenomenon, known as transcrip- 
tion-coupled repair (TCR), is a sub-pathway of global NER**. The cur- 
rent model of bacterial TCR postulates that a DNA lesion blocking the 
progression of the transcription elongation complex is shielded from 
NER enzymes by the stalled RNA polymerase (RNAP). The DNA trans- 
locase, Mfd, binds to the stalled elongation complex through the B subunit 
of RNAP and dislodges the complex by ‘pushing’ it forward’. Concurrently, 
Mfd recruits UvrA to the exposed lesion site to expedite NER”. 

Here we propose an alternative TCR model whose key component 
is UvrD, a member of DNA helicase superfamily 1, which translocates 
in a3’ to 5’ direction using a single-strand, DNA-dependent, ATPase 
activity''"*. In contrast to Mfd, UvrD facilitates NER by pulling RNAP 
backward from the DNA lesion without causing termination. Our model 
further explains the role of elongation factor NusA, which is known to 
contribute to Mfd-independent TCR™. In this model RNAP recruits 
the NER complex via UvrD/NusA to the damage site. 


UvrD binds RNAP in vitro and in vivo 


We performed a mass spectrometry (MS)-assisted survey of proteins 
that interact with E. coli RNAP in vivo by treating E. coli K12 MG1655 
cultures with formaldehyde and isolating RNAP-containing material. 
Peptides in this material were identified by tandem liquid chromato- 
graphy mass spectrometry (LC-MS/MS) and used to calculate an expo- 
nentially modified protein abundance index (emPAI). This label-free 


method estimates the relative amount of proteins by the number of 
sequenced peptides per protein compared with the number of theoret- 
ically observable peptides’*. UvrD appeared in RNAP crosslinked 
complexes in abundance comparable to that of bona fide transcription 
elongation/termination factors NusA, NusG or Rho (Extended Data 
Fig. 1), indicating a potential direct interaction between UvrD and RNAP. 
To verify that UvrD, interacts with RNAP directly, we performed 
in vitro pull-down assays with purified UvrD and His6-tagged RNAP 
adsorbed to metal-chelating beads. UvrD bound to the beads only in 
the presence of immobilized RNAP and remained bound through 
multiple washings (Extended Data Fig. 1b). The UvrD-RNAP core 
complex was also isolated in a major discrete peak by size-exclusion 
chromatography (Extended Data Fig. 1c). Collectively, these data 
demonstrate stable and specific binding of UvrD to RNAP. 


UvrD promotes RNAP backtracking 


To investigate the role of UvrD in transcription we reconstituted a 
single-round runoff assay that measured ‘walking’ (nucleoside 5’- 
triphosphate (NTP) supply-controlled elongation) of the elongation 
complex along DNA”. We observed little effect of UvrD on RNAP 
promoter binding and open complex formation (not shown); however, 
UvrD profoundly influenced elongation, interrupting transcription at 
many positions along the template, so that only a fraction of elongation 
complexes produced a full-length (runoff) transcript (Fig. 1a, lane 2). 
Some UvrD-induced transcriptional ‘arrests’ coincided with pre-existing 
pause sites, whereas others formed de novo. Most did not change signi- 
ficantly with time, suggesting that the corresponding elongation com- 
plexes were either permanently arrested or terminated by UvrD. We 
excluded the latter possibility by showing that most of those transcripts 
remained bound to RNAP after extensive washing with high-salt buffer 
(Fig. 1a, lane 3). 

Most transcriptional pauses and arrests are caused by backtracking— 
a reverse sliding of RNAP along DNA and RNA”. In bacteria, back- 
tracked RNAP is prone to transcript cleavage stimulated by Gre factors, 
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Figure 1 | UvrD promotes RNAP backtracking. a, EC20 was combined 
with wild-type UvrD (lanes 2-5), catalytically inactive UvrD”?!2 (Janes 6, 7), 
or no UvrD (lane 1) before NTP chase (lanes 1, 2, 6) and wash (lanes 3, 7). Red 
lines connect corresponding RNAs from arrested elongation complexes before 
and after GreB cleavage (lanes 4, 5). b, EC11 was walked to positions 36, 38 
and 39. UvrD was added + ATP and chased. Inactivated EC36-39 (%) is 
indicated. c, The p1EC constructs” (left) and primer extension analyses (right). 
CAA modifications on the non-template strand of plEC + pVector (lanes 2, 3) 
or plEC + pUvrD (lanes 4, 5). The lac operator (Lac) and transcription bubble 
are indicated. Red lines show elongation complex position. 
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which reactivate elongation complexes by removing the extruding 3’ 
portion of the nascent RNA”. To test whether elongation complexes 
arrested by UvrD were backtracked, we first washed them to remove 
unincorporated NTPs and UvrD, and incubated with GreB (Fig. 1a, 
lane 4). GreB shortened most transcripts from UvrD-arrested elonga- 
tion complexes and reactivated these complexes in the presence of 
NTPs (lane 5), indicating that UvrD induces backtracking at many 
positions during elongation (Extended Data Fig. 2). 

To determine whether the enzymatic activity of UvrD was required 
for the arrest, we repeated the above experiment with UvrD’””'° the 
E221Q mutation in the UvrD active site results in ~600-fold decrease 
in helicase/translocase activity without significantly affecting binding 
of DNA, ATP” or RNAP (not shown). UvrD®””!® failed to cause RNAP 
arrest during elongation (Fig. la, lanes 6, 7). 

To confirm the requirement of ATP for UvrD-mediated transcrip- 
tional arrest, we walked RNAP to stalled elongation positions (EC) 
36-39 (counting from the + 1 start of transcription), washed the beads 
to remove unincorporated NTPs and incubated the stalled complexes 
with UvrD + ATP (Fig. 1b). After 1 min of incubation with UvrD + ATP 
most of EC36, EC38 and EC39 were inactivated: they failed to resume 
elongation upon addition of NTPs (lane 15); inactivation was almost 
complete after 2 min (lane 16). Without UvrD (lanes 2-6) or with UvrD 
lacking ATP (lanes 8-12), only ~30% of EC39 was inactivated after 
8 min of incubation, whereas EC36 and EC38 remained fully active. 
We conclude that UvrD induces RNAP backtracking at many positions 
through its ATP-dependent motor function (Extended Data Fig. 2). 

To monitor the effect of UvrD on RNAP backtracking in vivo, we 
used a plasmid (pl1EC) in which RNA synthesis initiated at a con- 
stitutive promoter is halted at a downstream position by the lacO- 
bound Lac repressor (Fig. 1c). The plasmid was designed so that the 
repressor blocked one isolated elongation complex’. Cells carrying 
pEC] were transformed with a UvrD overexpression plasmid (pUvrD) 
or an empty vector (pVector). To monitor the effect of UvrD and the 
position of the halted elongation complex we performed in situ foot- 
printing of its DNA bubble using the single-strand-specific probe, 
chloroacetaldehyde (CAA) (Fig. 1c). The halted elongation complex 
was clearly backtracked over a longer distance in the presence of pUvrD 
than empty vector: new CAA reactive sites were detected upstream and 
the reactivity of the downstream margin of the footprint was decreased 
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(lane 4). Thus, as observed in vitro, UvrD also causes RNAP to back- 
track in vivo. 


UvrD facilitates NER by towing RNAP 


RNAP stalled at DNA lesions presents a major obstacle for NER by 
obscuring damaged sites from repair enzymes’. The role of UvrD, 
therefore, could be to clear such lesions by forcing the obstructing 
elongation complex to backtrack. To test this hypothesis we reconsti- 
tuted the first steps of NER in vitro (Fig. 2a and Extended Data Fig. 3). 
An initial elongation complex was assembled on DNA carrying a single 
cyclobutane pyrimidine dimer (CPD) in the template strand at posi- 
tion +57 with respect to the transcriptional start site (Fig. 2a and 
Extended Data Fig. 3). CPD is the most common ultraviolet-induced 
lesion, a substrate for UvrABC and a roadblock for RNAP??"*4 (Extended 
Data Fig. 3). RNAP positioned 46 nucleotides upstream of CPD (EC11) 
did not affect UvrABC-directed CPD excision (Fig. 2a, lanes 3, 4, 7, 8). 
However, chasing EC11 to CPD inhibited the UvrC DNA cleavage 
reaction (lanes 11, 12). Addition of UvrD and ATP restored UvrC 
endonuclease activity at the CPD site (Fig. 2a, lanes 13-16), indicating 
that towing RNAP from the lesion site by UvrD is the first step of NER 
(Fig. 2a). 


Anti-backtracking factors interfere with NER 


In the RNAP ‘towing’ model of NER factors that normally inhibit 
backtracking may hinder the repair process: the GreA and GreB tran- 
script cleavage factors'® and active ribosomes control transcription 
elongation through this inhibition’’. To test this prediction we used a 
primer extension assay to monitor ultraviolet-induced lesion repair in vivo. 
Such lesions block DNA polymerase, generating truncated primer exten- 
sion products that can be analysed at single-nucleotide resolution”. 
We used a plasmid-borne lacZ gene isolated from wild-type and back- 
tracking-prone greA greB mutant cells (Fig. 2b). The primer extension 
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Figure 2 | RNAP backtracking facilitates NER. a, UvrD pulls RNAP from 
thymine dimers (TT). The T7A1 promoter template with TT (red) and 
schematic overview (top); EC11 was chased to TT (lanes 9-16) (bottom). 
Where indicated, UvrD was added for 5 min. Red boxes show UvrC-mediated 
DNA cleavage products (19 nucleotides); per cent cleavage is averaged 

(+ s.e.m.) from four independent experiments P < 0.05 . b, The primer 
extension experimental overview (left). As shown on the right, primer extension 
products reflect the location of lesions. Orange lines indicate repair zones for 
percentage repair. Asterisks indicate representative lesions repaired faster in 
greAB than in wild-type cells (blue), at the same rate (green) or ‘non-disappearing’ 
bands for normalization (orange). UV, ultraviolet; WT, wild type. 
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of lacZ DNA taken from both wild-type and mutant cells before and 
shortly after ultraviolet irradiation revealed truncated species unique to 
the irradiated DNA (Fig. 2b). However, the lesion repair rate (trun- 
cated species disappearance) was considerably higher in gre cells than 
in wild-type cells. Only a few lesions were repaired at the same rate in 
wild-type and Gre-deficient cells. Because this assay detects any DNA 
damage that halts Taq DNA polymerase (not only pyrimidine dimers) 
we conclude that: (1) most ultraviolet-induced adducts within the tran- 
scribed region block the progression of RNAP, hampering the repair pro- 
cess, and (2) RNAP backtracking facilitates repair at most damage sites. 

If these conclusions are correct, GreA and GreB deficiency should 
suppress UvrD sensitivity to ultraviolet and genotoxic agents against 
which NER provides protection. We examined three genotoxic che- 
micals: mitomycin C, 4-nitroquinoline-1-oxide (4NQO) and cisplatin. 
These chemotherapeutics generate crosslinks and bulky DNA adducts 
that are predominantly processed by NER”. uvrD cells were highly 
sensitive to killing by all three agents, whereas inactivation of greAB 
greatly suppressed uvrD sensitivity to all three agents (Fig. 3a and 
Extended Data Fig. 4) and to ultraviolet irradiation (Fig. 3b and 
Extended Data Fig. 4c). As the function of GreAB is to suppress 
RNAP backtracking, we conclude that RNAP backtracking is required 
for efficient NER in vivo and that UvrD is responsible for much of the 
NER-associated RNAP backtracking during genotoxic stress. 

In bacteria, active ribosomes control the rate of transcription elonga- 
tion at protein coding sequences by ‘pushing’ RNAP forward’, whereas 
inhibited ribosomes render cells highly resistant to ultraviolet-mediated 
lethality’’”*. To test if active ribosomes interfere with NER by decreasing 
the frequency of backtracking we used a sublethal dose of chlorampheni- 
col to slow ribosomes’ translocation”*. Analogous to the situation with 
Gre-deficiency, chloramphenicol rendered uvrD cells more resistant to 
4NQO, mitomycin (Fig. 3a and Extended Data Fig. 4b) and ultraviolet 
(Fig. 3b and Extended Data Fig. 4d). 

Collectively, these results argue that UvrD competes with anti- 
backtracking factors during genotoxic stress to promote NER (Fig. 3c). 
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Figure 3 | Anti-backtracking factors obstruct UvrD activity in NER. 

a, Inactivating greAB or slowing ribosomal translocation (with 1 pg ml 
chloramphenicol) suppresses uvrD sensitivity to mitomycin C (1 wg ml’; 
filled bars) and 4NQO (1 LM; striped bars). Data from three independent 
experiments are presented as the mean + s.e.m.; *P < 0.05, **P < 0.01. Cm, 
chloramphenicol; WT, wild type. b, Inactivating greAB, mfd or slowing 
ribosomal translocation suppresses uvrD sensitivity to ultraviolet irradiation (5 J 
per m’ at 30 °C). Data from three independent experiments are presented as the 
mean ~ s.e.m.; **P < 0.01. ¢, Cartoon summarizing interference with NER. 
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Interestingly, UvrD counteracts GreAB even under normal growth 
conditions, because uvrD inactivation suppresses the temperature 
sensitivity of gre cells (Extended Data Fig. 5). 


Mapping UvrD-elongation complex interactions 


We next mapped the location of UvrD within the elongation complex. 
Single- a ended DNA junctions are the preferred UvrD load- 
ing sites”; therefore, the upstream exposed portion of the transcription 
bubble” is an optimal UvrD binding site. In this model, UvrD bound to 
RNAP pulls the elongation complex backwards while unwinding the 
upstream fork of the bubble (Fig. 3c). To locate UvrD within the elonga- 
tion complex we incorporated photo-inducible 4-thio-deoxyuridine- 
5'-monophosphate (4-thio-U) into the non-template strand at different 
positions (relative to catalytic site) to induce protein-DNA crosslinks. 
The —1 and —9 4-thio-U probes generated the most abundant cross- 
linked species corresponding to BB’ subunits of RNAP; their pattern 
didn’t change in the presence of UvrD (Fig. 4a, lanes 5-8). A unique 
UvrD-specific crosslinked species was detected only with the —9 probe 
(lane 6) and was consistent with the UvrD adduct. No significant cross- 
links were detected in front of RNAP (position +22). These results 
support the model that UvrD binds near the upstream fork of the 
transcription bubble. 

To further map interactions between UvrD and the elongation com- 
plex, we used bis[sulfosuccinimidyl] suberate (BS3) to crosslink proximal 
lysine residues between UvrD and RNAP. Using mass spectrometry we 
identified three inter-protein crosslinks between UvrD and RNAP. 
These crosslinks mapped to BB’ subunits of RNAP and clustered around 
the DNA binding region on UvrD (Fig. 4b and Extended Data Fig. 6). 
In particular, one crosslink, (f K909)-(UvrD K124), mapped to the B 
flap tip domain. During elongation, the flap tip is required for activity 
of the NusA elongation factor*’. We propose that UvrD also binds 
RNAP proximal to the flap tip domain in a way that allows UvrD to reach 
the non-template DNA strand near the upstream fork of the bubble. 
The positions of the remaining two crosslinks (f’ K79)-(UvrD K389) 
and (B’ K40)-(UvrD K448) are consistent with this hypothesis (Fig. 4b 
and Extended Data Fig. 6). 

Considering the proximity of UvrD and NusA on the surface of 
RNAP, and the fact that NusA potentiates RNAP backtracking”, we 
examined whether NusA supports UvrD-mediated backtracking. Indeed, 
NusA augmented UvrD-inducible arrests during elongation (Fig. 5a), 
providing a mechanistic explanation for the genetic evidence implic- 
ating NusA in Mfd-independent TCR™. Consistently, we showed that 
deletion of greB suppressed sensitivity of nusA cells (AnusA and nusA11) 
to 4NQO, mitomycin C, nitrofurazone (NFZ) and ultraviolet (Fig. 5b 
and Extended Data Fig. 7). It has been reported that UvrD and NusA 
directly bind UvrB and UvrA, respectively’***™. Thus, not only do the 
two elongation factors act synergistically to clear RNAP from the lesion 
site, they probably also recruit UvrAB to the damage site to facilitate 
repair (Fig. 5c). 


RNAP as a global DNA damage surveillance vehicle 


A major challenge for NER is that UvrAB must recognize almost every 
type of bulky adduct on DNA among millions of normal base pairs in 
the genome. The aberrations detected by UvrAB range from apurinic/ 
apyrimidinic (AP) sites to ultraviolet-induced photoproducts to large 
chemical adducts’. The accuracy and efficiency of the repair of such 
variable lesion types would be improved by a screening mechanism 
that recognizes a common structure or process, such as that of arrested 
RNAP during transcription elongation. 

The preferential repair of the template DNA strand, that is, TCR, 
implies that the transcription apparatus augments the search and/or 
binding of lesions by NER enzymes’. Cellular RNAPs are highly sensi- 
tive to aberrations in the template strand, and are temporally or per- 
manently blocked by various DNA adducts”****. As bacterial and 
eukaryotic genomes are pervasively transcribed, RNAP can serve as 
a global surveyor of DNA damage that constantly monitors the quality 
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Figure 4 | Mapping UvrD interactions with the elongation complex. a, The 
5'-radiolabelled scaffold carries a single photo-inducible 4-thio-dU (red) in the 
template strand (top). Protein crosslinking adducts corresponding to the B and 
6’ subunits of RNAP and UvrD (red asterisk) (middle). A model of UvrD 

binding (bottom). b, UvrD (top, Protein Data Bank (PDB) accession number 


of DNA via its natural one-dimensional diffusion. However, when 
stopped at DNA lesions, RNAP must either be terminated or moved 
aside to maintain damage site accessibility for NER. The Mfd pathway 
of TCR necessitates transcription termination’®, whereas the UvrD 
pathway described here uses RNAP backtracking. Importantly, this 
new UvrD mechanism facilitates NER without loss of RNAP, enabling 
transcription to promptly resume after DNA repair. 


Non-template 
strand 


Transcription elongation 


21S4)'? is crosslinked to RNAP (middle, PDB accession number 41GC)” at 
three positions (magenta) that span the B (yellow) and f’ (light blue) subunits, 
proximal to the non-template strand (blue, PDB accession number 4G70). The 
B flap-tip-helix (green) is indicated and the suggested binding interface is 
circled. Schematic summarizing UvrD-RNAP binding shown below. 


Mfd-independent TCR 


The present work argues that the bulk of TCR occurs via UvrD- 
dependent backtracking (Fig. 5c); the sensitivity of uvrD and nusA cells 
to ultraviolet and various DNA damaging agents is much greater than 
that of mfd cells (Figs 3, 5b and Extended Data Figs 4, 7, 8). Yet, pro- 
moting backtracking by eliminating Gre factors, or by slowing ribo- 
somal translocation, eliminates much of the DNA damage-induced 
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Figure 5 | UvrD and NusA cooperate in backtracking-mediated NER. 


nitrofurazone (NFZ, 2 1M), and ultraviolet irradiation (15 J per m’). Data from 
three independent experiments are presented as the mean + s.e.m.; *P < 0.05, 
**P < 0.01. c, Model for backtracking-mediated NER. 


a, NusA facilitates UvrD-mediated backtracking. The fraction of full-length 
transcript (percentage runoff) is indicated. b, GreB inactivation in MDS43 cells 
suppresses AnusA sensitivity to mitomycin C (0.5 pg ml’), 4NQO (3 uM), 
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lethality associated with UvrD or NusA deficiency (Figs 3, 5b and 
Extended Data Figs 4, 7). Consistently, deletion of DksA, a factor that 
competes with Gre for the secondary channel of RNAP, also increases 
E. coli sensitivity to mitomycin C**. Remarkably, deletion of m/fd itself 
prominently suppresses uvrD sensitivity to ultraviolet and mitomycin 
C (Fig. 3b and Extended Data Figs 4, 8). This is consistent with the anti- 
backtracking mechanism of Mfd’, further supporting the notion that 
RNAP backtracking is a prerequisite for most NER events (Fig. 3d). 
Survival after genotoxic challenge depends on both efficient NER and 
swift recovery from the SOS response. Indeed, greAB deletions facil- 
itate NER, but do not increase cell survival (Fig. 3a, b). Similarly, Mfd 
function may not be as important for TCR per se, but instead for 
cellular recovery after the excessive backtracking associated with the 
SOS response. 

In the absence of cellular stress, UvrD is equimolar with RNAP 
(~3,000 molecules per cell*”). During the SOS response, however, the 
intracellular level of UvrD increases approximately threefold’’. This 
spike facilitates UvrD dimerization and helicase activity’* and probably 
is a prerequisite for UvrD-mediated backtracking. A sigmoid-shaped 
graph of RNAP arrest as a function of UvrD concentration (Extended 
Data Fig. 9) supports this view. After washing, UvrD remains bound to 
RNAP (Extended Data Fig. 1b), yet loses backtracking ability (Fig. 1a), 
suggesting monomeric and multimeric associations with RNAP during 
periods of no stress and stress, respectively (Fig. 5c). Excessive back- 
tracking can be detrimental to genomic integrity in cells recovering from 
genotoxic stress and resuming replication, as frequent co-directional 
collisions between the replisome and backtracked elongation complexes 
result in dsDNA breaks (DSBs)*’. By ‘pushing’ backtracked RNAPs 
forward, Mfd suppresses DSBs associated with such collisions®, and 
hence diminishes the high frequency of mutations associated with DSBs 
repair. This model is consistent with the reduced ‘mutation frequency 
decline’ phenotype of mfd cells as well as their high ultraviolet mut- 
ability, minimal ultraviolet sensitivity*°, and compromised transcrip- 
tional recovery after ultraviolet exposure’’. 

The process of TCR and the basic structural organization of RNAPs 
are evolutionarily preserved’*’®. Bulky DNA lesions, such as thymine 
dimers, stall bacterial and eukaryotic RNAPs similarly””*. As back- 
tracking is a fundamental feature of all RNAPs’’ and occurs perva- 
sively throughout the eukaryotic genome”, it is likely to drive TCR in 
higher organisms as well. Notably, mammalian 3’-5' DNA helicase, 
XPB, and its homologues from other eukaryotes, associates with elong- 
ating RNAP II as a subunit of the TFIIH transcription factor. It has 
been implicated in NER as well as in several human disorders assoc- 
iated with deficient DNA repair**”. It is thus possible that the mech- 
anistic role of XPB is analogous to that described here for UvrD, that is, 
backtrack-inducing TCR. 


METHODS SUMMARY 


Cultures of E. coli were crosslinked with formaldehyde, collected by centrifu- 
gation and lysed. RNAP-containing material was pulled down with anti-RNAP 
antibodies and proteins were identified by mass spectrometry. For DNA damage 
experiments, colony-forming units were counted after exposure to increasing con- 
centrations of 4-NQO, mitomycin C or cisplatin. For ultraviolet survival assays, 
diluted cultures were plated, irradiated with a ultraviolet lamp and incubated over- 
night in the dark. Strains were transformed with pUC18 plasmid and induced with 
IPTG before and after irradiation. In vitro transcription assays were performed as 
described’*. RNA-DNA scaffolds containing thymine dimers or 4-thio-dUMP- 
modified template oligonucleotides were assembled with immobilized, biotinylated 
RNAP before incubation with components of UvrABC system or irradiation at 
308 nM for protein-DNA crosslinking, respectively. Purified RNAP and UvrD 
were crosslinked with BS3, and crosslinked proteins were reduced with dithio- 
threitol (DTT), alkylated with iodoacetamide and digested with trypsin. Tryptic 
peptides were analysed with a mass spectrometer coupled to a liquid chromato- 
graphy system. pLink“° was used to search Mascot generic format files for inter- 
peptide crosslinks. For in situ DNA footprinting, the uvrD gene was cloned into a 
vector under the Py teto-1 promoter, and induced with anhydrotetracycline for 1h 
before CAA modification. CAA modifications on non-template DNA were analysed 
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with primer extension using [**P]-labelled primer in parallel with the sequencing 
reactions. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Bacterial strains. The strains used in this study are listed in Extended Data Table 1 
and were constructed using standard genetic techniques. 

Mass spectrometry of in vivo RNAP interactome. Mid-exponential cultures of 
E. coli K12 MG1655 were treated with formaldehyde (Thermo Fisher, final con- 
centration 50 mM) for 30 min, cells were collected by centrifugation, and lysed by 
a combination of ultrasound disruption and the action of lysozyme (human recom- 
binant, Sigma-Aldrich). Pull-down of RNAP-containing material was performed 
using a mix of anti-RNAP antibodies (Neoclone) immobilized on Protein A/G 
Mag Sepharose beads (GE). Protein identification in the pull-down material was 
carried out at the NYUMC Protein Core Mass Spectrometry Facility, using an 
LTQ-Orbitrap mass spectrometer followed by Mascot database search (Matrix 
Science). Protein abundance was estimated using emPAI scores'*. The range of 
emPAI values for a representative set of proteins in RNAP pull-downs spans 3 
independent experiments (Extended Data Fig. 1). 

Measuring sensitivity to DNA damage. E. coli strains were cultured in LB med- 
ium overnight. Appropriate dilutions were spread on LB agar plates containing 
increasing concentrations of 4NQO or NFZ or mitomycin C or cisplatin, followed 
by an overnight incubation at permissive temperatures. Colony-forming units (c.f.u.) 
were counted at each concentration of genotoxic agents. Chloramphenicol was used 
at the sublethal concentration (1.0 1g ml), along with genotoxic agents. A 10 mM 
stock solution of NFZ or 4NQO was first made in N,N-dimethylformamide, stored 
at —20°C, and diluted appropriately for each experiment. Mitomycin C was dis- 
solved in sterile saline to a concentration of 500 1g ml‘ and stored at —20°C. 
Cisplatin was dissolved in LB medium to a concentration of 500g ml just 
before use. For ultraviolet survival assays, overnight grown cultures were diluted 
in M9 minimal salts, appropriate dilutions were spread on LB agar plates and 
irradiated with a ultraviolet (254 nm) lamp and incubated overnight in the dark 
at appropriate temperatures. 

In vivo TCR. Strains MG1655Z1 and MG1655Z1 AgreA AgreB were transformed 
with pUC18 plasmid. Overnight cultures were diluted 1:100 in LB medium sup- 
plemented with 100 pg ml! ampicillin and grown until Dgoo nm ~0.3 at 30 °C. 
The culture was centrifuged at 5,000 r.p.m. for 5 min. The pellet was resuspended 
in M56 to an absorbance of 0.4. 

The resuspended culture was spread on a Petri dish and, whilst being shaken, 
irradiated with 70 J per m” of 254 nm ultraviolet light. To induce the transcription 
on pUC18, 1 mM isopropyl-B-p-thiogalactoside (IPTG) was added to the culture 
medium 1h before and immediately after ultraviolet irradiation. The irradiated 
culture was then supplemented with 0.25% casamino acids, 0.004% thiamine and 
0.4% glucose and incubated at 30 °C in the dark to avoid removal of cyclobutane 
dimers by photoreactivation. Cells were collected before ultraviolet irradiation 
(-), immediately after (time 0 repair) and at defined repair times (15, 30, 45 and 
60 min). Plasmid DNA was isolated from the cultures using a Qiagen plasmid 
isolation kit. 

Primer extension was performed with [**P]-labelled primers for analysis of 
cyclobutane pyrimidine dimers on the template strand. End labelled primer, 
50 pmol, (5’-GGCATGCAAGCTTGGCACTGGC-3’), 400 uM dNTPs, 200 ng 
of plasmid DNA and Taq DNA polymerase were mixed in Thermopol buffer. 
The mixture was denatured by heating to 98 °C in a thermal cycler for 5 min 
followed by annealing at 55°C for 5 min. Primer extension was performed at 
68 °C for 6 min. Formamide stop solution was added to the reaction mixture and 
aliquots loaded onto a 6% urea-polyacrylamide gel. Percentages of repaired DNA 
were calculated using Image-Quant (GE) and averaged from three independent 
experiments with untreated samples taken as 100%. 

In vitro transcription assays. E. coli RNAP and UvrD were purified as described 
in refs 16 and 48, respectively. DNA templates were constructed using the non- 
transcribed part of T7 Al promoter fragments fused to appropriate transcription 
units. Resulting DNA fragments were PCR amplified and purified from an agarose 
gel. For biotinylated DNA templates, biotinylated DNA oligonucleotides (IDT) 
were used. All NTP substrates were purified by ion-exchange chromatography’®. 
Transcription at the T7A1-promoter templates was initiated by mixing 1 pmol of 
RNAP with equimolar amount of appropriate DNA PCR fragments in TB50 (TB, 
transcription buffer) (40mM Tris-HCl pH 8.0; 10mM MgCl; 50mM NaCl) 
followed by addition of 104M ApUpC RNA primer, 25M ATP and GTP. 
Incubation was continued for 5 min at 37 °C. Resulting complexes were labelled 
by addition of 2 Cu a-[°*P]-CTP (3,000 Cimmol ’; MP Biomedicals) for 5 min 
at room temperature and immobilized on Neutravidin beads (5-10 1]; Piers) in the 
presence of 1.5mg ml * heparin. The resulting EC20 was washed twice with 1 ml 
of TB1000 (TB with 1 M NaCl), twice with 1 ml of TB100 and divided into 10 pl 
aliquots. For the chase experiments, the EC20 was incubated with either the indi- 
cated amounts of UvrD or equal volumes of mock buffer for 5 min at room tem- 
perature and then chased for 5 min at 37 °C by addition of 0.1 mM NTPs. Where 
indicated, beads were washed after the chase with 1 ml of TB1000 and 1 ml of 


TB100. All reactions were stopped by equal volume of stop buffer (SB, 1X TBE 
buffer, 8 M urea, 20 mM EDTA, 0.025% xylene cyanol, 0.025% bromophenol blue). 
For the GreB cleavage assay (Fig. 2a), two 10 il aliquots were incubated for 5 min at 
room temperature with 114M GreB. One aliquot was then chased for 5 min by 
addition of 0.1 mM NTPs and both samples were quenched as above. To test the 
effect of NusA on UvrD-induced backtracking (Fig. 5a), the initial EC20 was 
prepared as above and incubated with the indicated amounts of UvrD in the 
absence or presence of 10nM NusA. Reaction mixtures were chased as above. 
For the UvrD-induced formation of stalled ECs (Fig. 2b), EC20 was walked to 
position 36-39 as described in ref. 16. The resulting ECs were mixed with 12.5 nM 
UvrD and 1 mM ATP or an equal amount of mock solution and incubated at 37 °C 
for the indicated time intervals. Next, 10 pl aliquots were withdrawn and mixed 
with 0.1 mM GTP and CTP for 5 min at room temperature. Reactions were stopped 
by addition of equal volumes of SB. The amount of UvrABC-mediated DNA 
cleavage products after 30 min at 37 °C was taken as 100% and UvrABC-mediated 
cleavage after CTP + GTP chase was calculated as a fraction of that value. Image- 
Quant (GE) software was used for all quantifications. 

Reconstitution of the EC and UvrABC-mediated processing of thymine 
dimers. For assembly of the RNA-DNA scaffold (Fig. 2a), 75 pmol Al RNA 
(IDT) were mixed with 75 pmol of the cyclothymine dimer template DNA strand 
or control strand (Midland) in 20 ul of the annealing buffer (AB) (12% glycerol; 
20 mM Tris pH 8.0; 40 mM KCl; 5 mM MgCl) and annealed gradually in a PCR 
cycler. The resulting scaffold was stored at —20°C and used as needed. To mea- 
sure the extent of UvrABC-directed cleavage, 15 pmol of the DNA-RNA scaffold 
were mixed with 15 pmol of biotinylated RNAP in 20 ull of AB and incubated for 
10 min at room temperature. 75 pmol of non-template strand was added and 
incubation continued for another 5 min. The resulting complex was immobilized 
at Neutravidin-coated beads (Pierce), washed twice with TB1000 and then with 
TB100. For RNA labelling, 5 uM ATP and 2 Cu «-[*’P]-CTP were added for 
5 min at room temperature, followed by washing with TB100. For the UvrC 
cleavage reaction complexes were washed twice with TBO (20mM Tris-HCl 
pH 8.0; 10 mM MgCl.) and 10X PNK buffer (New England Biolabs) was added 
together with 1 Cu y-[°P]-ATP (7,000 Cimmol '; MP Biomedicals) and 20 
units PNK (New England Biolabs). The DNA labelling reaction continued for 
30 min at 37°C, then the beads (EC11) were washed twice with TB1000 and 
TB100. EC11 were processed directly or chased first for 10 min at 37 °C by the 
addition of 1mM NTPs. Complexes were supplemented with 1mM ATP and 
12.5nM UvrD was added where indicated for 10 min at 37°C before washing 
with TB100. To initiate the cleavage reaction premixed UvrA (2.5nM), UvrB 
(10nM) and UvrC (25 nM) were added and incubated for the indicated time 
intervals at 37 °C before quenching with SB. UvrABC were purified as described 
in ref. 49. Products of all reactions were separated in 6-23% denaturing poly- 
acrylamide gels and visualized via phosphor-imaging using a Typhoon phosphor- 
imager (GE Healthcare) and Image Quant software (GE Healthcare). 
Protein-DNA crosslinking. RNA-DNA scaffolds were assembled as above 
using the 4-thio-dUMP-modified template oligos (Midland), Al RNA (9 nt long) 
and non-template DNA. Resulting complexes were labelled as described above 
and incubated with 12.5nM UvrD or mock solution for 5 min at 37 °C before 
being irradiated for 10 min with a hand-held ultraviolet lamp (Cole Parmer) at 
308 nM wavelength. Irradiation was performed on ice and samples were dena- 
tured in Laemmli loading buffer. Products were separated in a 6% denaturing 
polyacrylamide gel with 0.1% SDS and Tris-glycine running buffer. UvrD adducts 
were seen on the gel as a 90 kDa species. 

UvrD-RNAP binding assay. UvrD and RNAP, UvrD alone, and RNAP alone 
were incubated at room temperature for 30 min in binding buffer (50 mM Tris 
pH 8.0, 100 mM NaCl, 50 mM imidazole) before they were added to His Mag 
Sepharose Ni beads (GE), which were pre-equilibrated and washed with binding 
buffer. Each was then incubated for 30 min with gentle agitation at room tem- 
perature. After incubation, beads were washed three times with binding buffer 
and eluted in 50 ll of elution buffer (50 mM Tris pH 8.0, 100 mM NaCl, 400 mM 
imidazole). Samples were electrophoresed in an SDS-polyacrylamide gel along- 
side pure UvrD, for reference. 

RNAP-UvrD crosslinking and their mapping by LC-MS/MS. Purified RNAP at 
a final concentration of 2 [1M was incubated with 4 1M UvrD in crosslinking buffer 
(1X PBS, 500mM NaCl, pH7.4) for 30 min at room temperature. Crosslinking 
was initiated upon addition of Bis(sulfosuccinimidyl) suberate (BS3) (Thermo 
Scientific) in 1XPBS at a final concentration of 350 uM. Crosslinking reactions 
continued for 30 min at room temperature before they were quenched with 1M 
Tris pH 8.0 at a final concentration of 200 mM. 

Crosslinked proteins were separated by SDS-PAGE (4-12% Bis Tris Novex gels, 
Invitrogen) and stained with GelCode Blue. Super-shifted (relative to positions of 
untreated RNAP subunits) were removed from the gel and destained. Samples 
were reduced with 20mM DTT (Sigma) at 57 °C for 60 min and alkylated with 
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45 mM iodoacetamide (Sigma) in the dark for 45 min at room temperature before 
overnight, in-gel digestion with 0.5\1g sequencing grade modified trypsin 
(Promega). 

Gel slices were incubated with Poros R2 50 jm slurry diluted in 5% formic acid 
and 0.2% TFA to bind tryptic peptides. The Poros bead slurry was collected and 
washed through C18 Zip Tips (Millipore) before elution with 40 ll of 40% acet- 
onitrile in 0.5% acetic acid. Eluted peptides were dehydrated in vacuum and 
resuspended in 40 il 0.5% acetic acid for MS analysis. 

Peptide aliquots (~200 ng) were analysed in the Q Exactive mass spectrometer 
(Thermo Scientific) coupled to an EASY-nLC (Thermo Scientific) liquid chro- 
matography system. The peptides were eluted over a 120-min linear gradient 
from 98% buffer A (water) to 40% buffer B (ACN) then continue to 100% buffer 
B over 10 min with a flow range of 200 nl min™ | Each full MS scan (R = 70,000) 
was followed by dd-MS? (R= 17,500) with HCD and an isolation window of 
2.0 m/z. Normalized collision energy was set to 35. Precursors of +1, +2 and +3 
were excluded from MS2 scans; monoisotopic screening was enabled and a 
dynamic exclusion window was set to 30.0s. 

For identification of crosslinks, mascot generic format (mgf) files were gener- 
ated from Xcalibur raw files using the Proteome Discoverer software (Thermo 
Scientific). pLink*® was used to search resulting megf files for interpeptide cross- 
links with the following settings: precusor mass tolerance 20 p.p.m.; fragment 
mass tolerance 5 p.p.m.; BS3 crosslinker (crosslink monoisotopic mass shift of 
138.0680786 Da, monolink mass shift of 156.0786442 Da). One fixed modification 
(Carbamidomethyl-C) and three variable modifications (oxidation-M, N-terminal 
glutamate to pyroglutamate, N-terminal acetylation) were specified. False discov- 
ery rate was set at <5%. Fragmentation spectra for each crosslinked peptide were 
validated using pLabel”® and confirmed with manual inspection. 

In situ DNA footprinting. To construct the plasmid pUvrD (Cm’, p15a), the 
uvrD gene was excised from the plasmid pETDuet-uvrD as an Xhol / filled-in 
Xbal fragment and cloned into the pZA31 vector at the SalI site (compatible 
cohesive end with Xhol) and the blunted KpnI site. The expression of the uvrD 
gene in the resulting plasmid is driven by the P,-derivative promoter Piteo-1 
which is controlled by the operator repressor system of the Tn10-derived tet 
resistance operon”'. The induction of Pyteto-1 is achieved by anhydrotetracycline. 
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To ensure expression of the regulatory repressor proteins LacI and TetR, the 
entire unit encoding LacR, TetR and Sp’? was transferred into the chromosome 
of E. coli strain MG1655 by P1 transduction (MG1655Z1). 

E. coli strains were grown in 10 ml of M9 medium supplemented with 0.4% 
glucose, 4mg ml ' casamino acids, 100 1g ml‘ ampicillin and 30 pg ml’ chlor- 
amphenicol. To induce the Pyteto-1 promoter, anhydrotetracycline (50 ng ml‘) 
was added for 1h before CAA modification. At Déo9 nm ~ 0-6, 1 mM IPTG was 
added (if required) for 10 min, then CAA was added to 4% final concentration 
and the incubation was continued for 7 min. Then cells were collected, immedi- 
ately washed with the saline solution, collected and the plasmid DNA extracted. 

To analyse CAA modification’ of the non-template DNA strand, primer 
extension was performed with [°?P]-labelled primer (5'-TAGCTTCCTTAGCT 
CCTGA-3’) in parallel with the sequencing reactions in a cycler at the following 
conditions: 94 °C for 3 min (initial denaturation), then 94 °C for 30 s (denaturation), 
56 °C for 30s (annealing), 72°C for 60s (extension), 25 cycles total. Formamide 
stop solution was added to the reaction mixture and aliquots were analysed by 
electrophoresis in an 8% urea-polyacrylamide gel. 
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A Be-type star with a black-hole companion 


J. Casares), I. Negueruela’, M. Rib6*, I. Ribas®, J. M. Paredes*, A. Herrero’? & S. Simon-Diaz!? 


Stellar-mass black holes have all been discovered through X-ray 
emission, which arises from the accretion of gas from their binary 
companions (this gas is either stripped from low-mass stars or sup- 
plied as winds from massive ones). Binary evolution models also pre- 
dict the existence of black holes accreting from the equatorial envelope 
of rapidly spinning Be-type stars’ (stars of the Be type are hot blue 
irregular variables showing characteristic spectral emission lines of 
hydrogen). Of the approximately 80 Be X-ray binaries known in the 
Galaxy, however, only pulsating neutron stars have been found as 
companions’ ~*. A black hole was formally allowed as a solution for 
the companion to the Be star MWC 656 (ref. 5; also known as HD 

215227), although that conclusion was based on a single radial 
velocity curve of the Be star, a mistaken spectral classification® and 
rough estimates of the inclination angle. Here we report observa- 
tions of an accretion disk line mirroring the orbit of MWC 656. 
This, together with an improved radial velocity curve of the Be star 
through fitting sharp Fe 1 profiles from the equatorial disk, and a 
refined Be classification (to that of a B1.5-B2 III star), indicates that 
a black hole of 3.8 to 6.9 solar masses orbits MWC 656, the candidate 
counterpart of the y-ray source AGLJ2241+4454 (refs 5, 6). The 
black hole is X-ray quiescent and fed by a radiatively inefficient 
accretion flow giving a luminosity less than 1.6 X 10” times the 
Eddington luminosity. This implies that Be binaries with black-hole 
companions are difficult to detect in conventional X-ray surveys. 

The majority of Be X-ray binaries’ (BeXBs) contain proven neutron 
stars and are characterized by their transient changes in X-ray lumin- 
osity, when episodes of increased accretion onto the compact star are 
modulated either by the periastron passage or tidal disruption of the Be 
circumstellar disk*. When in quiescence, they have very low (or even 
undetectable) X-ray emission. It has been proposed that BeXBs with 
black holes (Be-BH) are difficult to find because of efficient disk trun- 
cation, leading to very long quiescent states’. Alternatively, their absence 
could be driven by binary evolution, with Be-BH binaries having a lower 
probability of being formed and surviving a common envelope phase’. 

MWC 656 is a Be star located within the error box of the point-like 
y-ray source AGL J2241+4454 (ref. 9). A photometric modulation of 
60.37 + 0.04 days was reported, suggesting that MWC 656 is a mem- 
ber of a binary’; this was subsequently confirmed through radial velo- 
cities of Het lines from the photosphere of the Be star’. The radial 
velocity curve, though, displays considerable scatter, likely to be caused 
by filled-in emission from the circumstellar wind contaminating the 
broad absorption profiles (a common limitation in the analysis of BeXBs; 
see, for example, ref. 10) and only a tentative orbital solution is available’. 
Here we revisit the 32 Liverpool telescope spectra previously reported’. 
These are complemented with 4 additional Liverpool telescope spectra 
anda further high-resolution echelle spectrum obtained with the 1.2-m 
Mercator telescope (see Methods and Extended Data Table 1). 

A close-up of the Mercator telescope spectrum is presented in Fig. 1, 
showing classic Fe 11 emission lines from the Be circumstellar disk. In 
addition, a He ll 4,686 A emission line (which was overlooked in a pre- 
vious work®) stands out clearly: its presence is remarkable because it 
requires temperatures hotter than can be achieved in disks around 


B-type stars. Further, the He 1 profile is double-peaked, which is the 
signature of gas orbiting in a Keplerian geometry"’. Gaussian fits to the 
He profiles in the Liverpool telescope spectra reveal that the centroid 
of the line is modulated with the 60.37-day orbital period, reaching 
maximum velocity at photometric phase 0.06 (see Methods and Extended 
Data Fig. 1). This is approximately in antiphase with the radial velocity 
curve of the Be star’, a strong indication that the He 11 emission arises 
from gas in an accretion disk around the invisible companion and not 
from the Be disk. We can therefore use its radial velocity curve to trace 
the orbit of the Be companion. An eccentric orbital fit to the He ! velo- 
cities was performed using the Spectroscopic Binary Orbit Program 
(SBOP”), fixing the period to 60.37 days (Methods); the resulting orbital 
elements are given in Extended Data Table 2. The orbital evolution of 
the Hei line is presented in Fig. 2. The line flux is also found to be 
modulated with the orbital period (Methods and Extended Data Fig. 1), 
owing to the presence of an S-wave component swinging between the 
double peak (see Fig. 2). 

To improve on the radial velocity curve of the Be star previous! 
reported’, we fitted the sharp double-peaked profile of the Fe 11 4,583 A 
emission line with a two-Gaussian model (Methods). Felt lines are 
known to arise from the innermost regions of the circumstellar disk’*"*, 
and therefore reflect the motion of the Be star much more accurately 
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Figure 1 | Optical spectrum of MWC 656, obtained with the Mercator 
telescope. The spectrum has been rebinned to 0.1 A per pixel and smoothed 
through a 3-pixel Gaussian bandpass. The emission lines Fe 11 4,583 A (of 
multiplet 38) and He 11 4,686 A are indicated. Whereas Fel is formed in the Be 
circumstellar disk, He 1! arises from gas encircling the companion star. Several 
other circumstellar Fe 1 lines are detected in the spectrum (for example, 4,549, 
4,555, 4,629, 4,666, 4,731 A) but they are weaker and/or severely blended with 
other emission lines and photospheric absorptions. 
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Figure 2 | Orbital evolution of the Fe 11 4,583 A and Het 4,686 A emission 
lines. a, Sequence of spectra folded into 15 phase bins using the photometric 
ephemeris®. ¢ indicates the binary orbital phase. Vertical dashed lines indicate 
the central wavelength of each line. For the sake of clarity, the flux of the Fem 
line has been arbitrarily scaled by a factor of 2 with respect to Hem. The Het 
double peak is distorted by an S-wave component, typically associated with a 
bright spot or asymmetry in the outer accretion disk”. b, Trailed intensity 
image of the two lines constructed from the phase binned spectra. Two orbital 
cycles are displayed for clarity. The colour scale indicates counts normalized to 
the continuum, with the black colour corresponding to 0.98 and the white 
colour to 1.08 in Fe! and 1.16 in He. 


than the broad He 1 absorptions. The Fe 1! velocities were also modelled 
with SBOP, fixing the period to 60.37 days. The eccentric orbital fit 
results in orbital elements that are consistent with the Fe 1 orbit being 
the reflex of the He 1! orbit, as expected from the motion of two com- 
ponents in a binary system (see Methods and Extended Data Table 2). 
Only the eccentricities are slightly discrepant, but just at the 1.6a level. 

Consequently, we modelled the ensemble of Fe! and Hew! radial 
velocities with a double-line eccentric binary orbit in SBOP. Figure 3 
presents the radial velocity curves of the two emission lines with the best 
combined solution superimposed. The resulting orbital elements are listed 
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Figure 3 | The radial velocity curves of the Be star and its companion. Filled 
circles indicate radial velocities of the Be star, as obtained from the Fe 1 4,583 A 
line; open circles indicate those of the companion star, extracted from the 

He 1 4,686 A line. Error bars, 1 s.d. The best combined orbital fit is overplotted. 
The velocities have been folded with the photometric ephemeris’ as in Fig. 2. 
A shift of +30 kms | was applied to the He! velocities to compensate for the 
S-wave contamination (Methods and Extended Data Fig. 2). Two Fe 1! velocities 
(not shown) were found to deviate by >2c¢ (where ar is the r.m.s. of the fit) 
from the best solution and were excluded from the fit, but they have a negligible 
impact on the final orbital elements. 


and Extended Data Fig. 3). Using excitation temperature diagnostics 
based on several absorption line ratios, we determine a spectral type of 
B1.5-B2, and the strength of the metallic lines combined with the 
moderate width of the Balmer absorption wings implies luminosity 
class III (Methods). Our adopted B1.5-B2 III classification implies a 
mass of 10-16 solar masses (Mo) for the Be star (Methods), and hence 
a companion star of 3.8-6.9Mo. 

The large dynamical mass of the companion to the Be star MWC 656 
is puzzling. A normal main-sequence star with such mass would have a 
spectral type in the range B3-B9 and its spectrum would be easily 
detected in the optical range. Nor can it be a subdwarf, because these 
typically have masses in the range 0.8-1.3 Mo (ref. 15). The stripped 
He core of a massive progenitor (that is, a Wolf-Rayet star) is also 
rejected because it possesses strong winds which show up through 
intense high-excitation emission lines, not present in our spectra. In 


Table 1 | Orbital elements for MWC 656 


Parameter Value 
Porb (days) 60.37 (fixed) 
To (HJD — 2,450,000) 3,243.70 + 4.30 
e 0.10 + 0.04 
@ (degrees) 163.0 + 25.6 
y(kms +) -141+2.1 
K, (kms 1) 32.0453 
Kz (kms~1) 78.1 43.2 
aysini (Ro) 38.0 + 6.3 
agsini (Ro) 92.8+3.8 
Mysin?i (Mo) 5.83 +0.70 
Mosin3i (Mo) 2.39 +048 
Mo/M, 0.41 + 0.07 
a (kms_?) 16.7 


in Table 1. Our solution yields a mass ratio q = M2/M, = 0.41 + 0.07, 
which implies a rather massive companion star. A precise determina- 
tion of the companion’s mass requires an accurate spectral classifica- 
tion of the Be star. Accordingly, we have compared the Mercator telescope 
spectrum with a collection of observed B-type templates, broadened by 
330kms_ | to mimic the large rotation velocity in MWC 656 (Methods 


The solution was obtained from a combined fit to the radial velocity curves of the He 11 4,686 Aand Feu 
4,583 A lines. The orbital period P.,, has been fixed to the photometric value®. To is the epoch of 
periastron (where HJD refers to heliocentric Julian date), e the orbit eccentricity, w the longitude of 
periastron, y the systemic velocity, K the velocity semiamplitude, a the semimajor axis, i the binary 
inclination, M the stellar mass and o; the r.m.s. of the fit. Subscripts 1 and 2 refer to the primary (Be star) 
and secondary (companion) components, respectively. To implies that periastron passage occurs at 
photometric phase 0.01 + 0.10. 
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addition, this should be detected as an ultraviolet excess in the spectral 
energy distribution, which is not observed in the fluxes available in the 
literature (Methods). On the other hand, the evidence of a He Il accre- 
tion disk encircling the companion star strongly points towards the 
presence of a compact object. The large dynamical mass rules out a 
white dwarf or a neutron star, so the only viable alternative is a black 
hole. It should be noted that none of the ~170 BeXBs currently known* 
shows any evidence for an accretion disk, providing circumstantial 
evidence for a difference in the nature of the compact stars. The accre- 
tion disk in MWC 656 is expected to also radiate Balmer and He! lines 
but these are blended with the corresponding (stronger) emission lines 
from the Be disk and thus are not detected. 

MWC 656 is a key system in the study of BeXBs and massive binary 
evolution. At a distance d= 2.6 + 0.6kpc (Methods) it is relatively 
nearby and also one of the visually brightest Be binaries’. It thus seems 
reasonable to assume that many other Be-BHs exist in the Galaxy but 
remain hidden by the lack of transient X-ray activity. Analysis of 
archival ROSAT images yields an upper limit to the X-ray flux at 
energies of 0.1-2.4 keV of 1.2 x 10‘ ergcm *s ' (Methods) which, 
for our estimated distance, translates into an X-ray luminosity 
Ix <1.0X 10° ergs”' or <1.6 X 107” times the Eddington lumin- 
osity, Lgaa. Therefore, accretion is highly inefficient in MWC 656, akin 
to accretion onto black holes in quiescent low-mass X-ray binaries’®, 
where accretion disks are truncated at ~10°-10* Schwarzschild radii 
and then behave as an advection dominated accretion flow’”. 

In the context of disk instability theory, the very low mass-transfer 
rates expected for BeXBs (with peak values of ~10'’ Mo yr ‘ near 
periastron) lead to extremely long outburst recurrence periods or even 
to completely suppressed transient activity’®. It is the dormant con- 
dition of the accretion disk together with the absence of a solid surface 
reradiating the accretion energy that makes Be-BHs very difficult to 
detect through X-ray surveys, thus providing an explanation for the 
missing Be-BH population. This is in stark contrast with the other 
black-hole high-mass X-ray binary known in the Galaxy, Cygnus X-1, 
where an X-ray-persistent accretion disk is fed by the powerful wind 
(~10 *Moyr_') of an O supergiant star’. 

The detection of a Be-BH is also important for our understanding of 
BeXB evolution. Whereas the total number of neutron-star BeXBs in 
the Galaxy depends strongly on the distribution of kick velocities, the 
number of Be-BHs is very sensitive to the survival probability during 
the common envelope phase’. Modern population synthesis models pre- 
dict a Galactic number ratio of neutron-star to black-hole BeXBs of 54, 
for the case of no common envelope survival during the Hertzsprung 
gap and a Maxwellian distribution of kick velocities with reduced root 
mean square (r.m.s.) ¢ = 133km s + (model C in ref, 3). There are 
currently ~81 BeXBs known in the Galaxy with ~48 pulsating neutron 
stars*”°, and thus our discovery ofa black-hole companion to MWC 656 
is consistent with these model predictions. However, it should be noted 
that the X-ray spectra of the remaining BeXBs, whenever they are 
available, also indicate the presence of a neutron star. Further, in stark 
contrast with the known BeXBs, MWC 656 has been identified through 
a claimed y-ray flare (see Methods) and not by its X-ray activity. This 
seems to imply that the discovery of Be-BHs is observationally biased, 
in which case common envelope mergers would be less frequent than 
commonly assumed and/or neutron star kicks would be best described 
by the radio pulsar birth velocity distribution’. Last, it is interesting to 
note that MWC 656 will probably evolve into a black-hole/neutron-star 
binary, a potential source of strong gravitational waves and a short y-ray 
burst (Methods). 


METHODS SUMMARY 


The Fe 11 line was fitted with a two-Gaussian model with Gaussian positions, intens- 
ities and separation left as free parameters. The Fe 11 velocities were obtained from 
the mean of the Gaussians offset with respect to the rest velocity at 4,583.837 A. A 
detailed radial velocity study of the He profile was performed using the double- 
Gaussian technique”', and this shows that the systemic velocity is pushed down in 
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the line core by the S-wave component while the phasing and velocity semiampli- 
tude remain very stable. Therefore, a +30kms | offset was applied to the Hen 
velocities before fitting all the data points with a double-line orbital model. The 
S-wave also modulates the He ui flux with the orbital period, and its phasing can be 
interpreted as either enhanced mass transfer during periastron or the visibility of a 
hotspot in the outer accretion disk. The spectral classification of the star was 
obtained by direct comparison with a range of templates conveniently broadened, 
and the best match is provided by the B1.5 III star HD 214993. We used several 
calibrations available in the literature to constrain the mass of MWC 656 from its 
spectral type, including evolutionary tracks, a dynamical determination from the 
detached eclipsing binary V380 Cyg and robust lower limits from dynamical 
masses of main-sequence stars. The distance is obtained through combining the 
absolute magnitude of B1.5-2 III stars with the observed brightness of MWC 656, 
corrected for interstellar reddening. Upper limits to the X-ray flux of MWC 656 are 
derived from archival ROSAT and Swift pointings, using a neutral hydrogen 
column density Ny = 1.4 X 107! cm? anda photon index I’ = 2.0, typical of black 
holes in quiescence. We further discuss the future evolution of MWC 656 and its 
fate, a possible black-hole/neutron-star binary. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Spectroscopic observations. Thirty-two 10-min spectra of MWC 656 were obtained 
between 23 April and 28 July 2011 using the Fibre-fed RObotic Dual-beam Optical 
Spectrograph (FRODOspec) on the robotic 2.0-m Liverpool telescope at the 
Observatorio del Roque de Los Muchachos on La Palma (Spain). Full details of 
these observations are given elsewhere’. Four additional FRODOspec spectra were 
collected on the nights of 28 May and 2-4 June 2012 using an identical instrument 
configuration. We also used the High Efficiency and Resolution Mercator Echelle 
Spectrograph (HERMES) on the 1.2-m Mercator telescope to obtain a 15-min 
spectrum on the night of 26 October 2012. We employed the high-resolution mode 
in HERMES which yields a resolving power R = 85,000 across the entire optical 
range 3,770-9,000 A. A series of B-type MK standards (where MK refers to the 
Morgan-Keenan spectral classification) were also observed with HERMES on the 
nights of 14 June and 9 November 2011 using an identical set-up as for MWC 656. 
The automatic pipeline products were used for the extraction and calibration of the 
Liverpool telescope and Mercator telescope data. A full log of the observations is 
presented in Extended Data Table 1. 

Radial velocity analysis. Radial velocities were extracted from the Hei 4,686 A 
emission by fitting a single Gaussian to the line profile in the 36 Liverpool telescope 
spectra. A least-squares sine fit to the velocity points yields an orbital period of 
59.5 + 0.6 days (Extended Data Fig. 1a), which is consistent within 1.40 with the 
(more accurate) photometric period determination®. The difference is also explained 
by the fact that our radial velocities only cover two full orbital cycles. Consequently, 
we henceforth adopt 60.37 + 0.04 days as the true orbital period of the binary. The 
radial velocities were modelled with an eccentric binary orbit using the code 
SBOP”, with the period fixed to 60.37 days. Individual points were weighted 
proportionally to 1/o”, where a is the radial velocity uncertainty. We adjusted 
the following orbital parameters: epoch of periastron (To), eccentricity (e), argu- 
ment of the periastron (@), systemic velocity (y) and velocity semiamplitude (K). 
The resulting orbital elements are listed in the first column of Extended Data Table 
2, together with their implied fundamental binary parameters. The fitted solution 
yields maximum velocity at phase 0.06, where phase 0 is arbitrarily set to HJD 
2453243.3 or the epoch of maximum brightness in the photometric light curve’. 
This is approximately in antiphase with the radial velocity of the Be star previously 
determined’, and therefore the He 1 emission most probably follows the orbit of 
the companion star. 

To further constrain the orbital motion of the Be star, we measured radial 
velocities from the Fer 4,583 A emission line by fitting a two-Gaussian model 
to the individual profiles. The Gaussians are set to have identical widths but their 
intensities and separation are allowed to vary to account for possible profile changes. 
In any case, these three free parameters are found to be very stable, with differences 
which are always within 10%. Only in one case was the double-peak separation 
found to be 13% smaller than the average. This corresponds to one spectrum where 
the profile appears blurred. For this particular case we decided to fix the Gaussian 
separation to the average value, 270kms_'. The radial velocity of the Fe 1 line was 
then obtained from the centre of the double Gaussian model relative to the line rest 
velocity at 4,583.837.A. We subsequently fitted the radial velocity points with an 
eccentric orbit model using SBOP after fixing the orbital period to 60.37 days and 
weighting data points proportionally to 1/0, as before. The best set of fitting 
parameters are listed in the second column of Extended Data Table 2. 

The two independent solutions presented in Extended Data Table 2 give peri- 
astron phases Ty which are consistent at 1.0c. Also, the arguments of periastron are 
in antiphase within 1.00 uncertainties, that is, @per] = Hen + 180°. Further, the 
eccentricities are consistent at the 1.6a level. These are all strong indications that 
both radial velocity curves display the reflex motion of two components in a binary 
system, thus endorsing fitting a combined double orbit to the ensemble of 72 He 11 
and Fe n data points. In any case, the radial velocity amplitudes of the single line orbits 
are found to be very robust and in excellent agreement with the combined double- 
lined solution presented in the main text. These are the key parameters constrain- 
ing the binary mass ratio and thus the nature of the companion to the Be star. 
The diagnostic diagram. Extended Data Table 2 shows a 30kms  ' difference 
between the systemic velocities of the Fe 11 and He solutions, which we interpret 
as due to contamination by the S-wave component seen in the core of the Hell 
profile”. To test this, we measured radial velocities from different velocity bands of 
the Het line profile, using the double-Gaussian technique”’. To further increase 
the signal-to-noise ratio, we averaged the 36 Liverpool telescope spectra into 15 
phase bins, relative to the photometric ephemeris. Every phase-binned spectrum 
was convolved with a two-Gaussian passband where the Gaussian separation was 
varied in the range a = 200-600kms_' in steps of 50kms'. Gaussian separa- 
tions a<200kms_' produce radial velocities which are corrupted by the variable 
shape of the double-peaked profile, while for a > 600 kms! the line flux becomes 
too weak. Each radial velocity curve was subsequently fitted with the expression 
V(g) = y + Ksin[21(g — @o)], where g is the binary phase, and the evolution of the 


fitting parameters K, y and gp versus the Gaussian separation a is displayed in 
Extended Data Fig. 2 as a diagnostic diagram”. Note that we prefer to fit a simple 
sine wave model rather than a full eccentric orbit because the extra fitting para- 
meters (that is, e, w and periastron phase) become poorly constrained as we 
approach the noisy wings of the line profile. Given the small orbital eccentricity 
the fitted sine wave parameters are still meaningful, although they should be taken 
as a first approximation to the true values because of the model oversimplification. 

The high-velocity wings of the He 11 profile are formed in the inner regions of the 

accretion disk and are less affected by the core S-wave component. Therefore, they 
are expected to closely trace the motion of the compact star. Extended Data Fig. 2 
shows that, as we move away from the line core, the systemic velocity rises quickly 
from about —46kms_' to about —25kms°', thus approaching the value of the 
Fe solution. This demonstrates that the Fe 11 velocities provide a more accurate 
description of the true binary systemic velocity than the centroid of the Het 
line and, therefore, we decide to apply a +30kms offset to the latter. Beyond 
a>500kms! the continuum noise begins to dominate the radial velocities, as 
indicated by the steeper rise in the control parameter o(K)/K (ref. 23), and thus the 
fitted parameters are less reliable. The diagram also illustrates that both phasing 
and velocity semiamplitude are very stable in the interval a = 200-500kms'', 
ranging between @y = 0.77-0.82 and K = 76-92kms_'. Beyond a>500kms ' 
K drops below 70kms_' and hence there is a possibility that the velocity semi- 
amplitude of the compact star was overestimated in Table 1. Note, however, that 
this would raise the binary mass ratio which would make even stronger the 
argument for a black-hole companion. 
Orbital modulation of the He II flux. Extended Data Fig. 1b shows that the 
equivalent width of the He 4,686 A line is strongly modulated with the orbital 
period. It peaks at phase ~0.9, which is very close to the maximum in the pho- 
tometric light curve (that is, phase 0 by convention). On the other hand, the 
amplitude of the equivalent width modulation is an order of magnitude larger 
(~40%) than that of the photometric light curve®*. These two arguments imply 
that the equivalent width variability is driven by true changes in the line flux rather 
than in the continuum. 

Extended Data Fig. 1 also reveals that the maximum equivalent width (phase 

0.93 + 0.04) almost coincides with the peak in the He 1! radial velocity curve, when 
the compact star is receding from us at maximum speed. The latter agrees well with 
the time of maximum visibility of the hotspot or shock region between the gas 
stream and the accretion disk. Alternatively, the modulation of the He mt flux can be 
interpreted as caused by enhanced mass transfer near periastron, which is con- 
strained to phase 0.01 + 0.10 by our orbital solution. 
Spectral classification, mass and distance to MWC 656. The spectral classifica- 
tion of MWC 656 is complicated by the effects of fast rotation, and by the presence 
of many (mostly Fe1!) emission lines affecting many of the features useful for 
this purpose. To provide an improved classification, our high-quality Mercator 
telescope+HERMES spectrum was compared to the spectra of several MK stan- 
dards taken with the same instrumentation and set-up as for MWC 656. Before 
this, we measured the rotational broadening (vsini) in MWC 656 by applying the 
Fourier technique’, combined with a goodness-of-fit method’’, to the Het 
absorption profiles (4,387 A, 4,471 A and 4,922 A) and the Sim line 4,552 A. 
This technique allows us to disentangle the different contributions to the line 
broadening, with the first zeroes of the transformed profile due to rotation. We 
obtain vsini = 330 + 30kms_!, with the error reflecting the dispersion of the 
individual lines, in good agreement with the value previously reported’. All our 
MK standards have very small intrinsic broadenings”’, and thus they were subse- 
quently broadened by 330kms ' to reproduce the observed broadening in 
MWC656. This was done by convolving the MK spectra with a Gray rotational 
profile”, using a limb-darkening coefficient ¢ = 0.34 which is appropriate for the 
stellar parameters of our target (see below) and the spectral range of interest. 

The narrow wings of the Balmer lines definitely indicate that the star is a giant, 
while the strength of the O 11 spectrum makes it B2 or earlier, in contrast with a 
previous classification® which reported a B3 IV. The absence of Sitv and Het 
absorption lines places MWC 656 in the B1-B2 range. As shown in Extended 
Data Fig. 3, the best match to the overall spectrum is provided by the B1.5 III 
standard HD 214993, although MWC 656 has rather stronger N 1 features. Nitrogen 
enhancement is frequently found in fast rotators*”’, and understood as a natural 
product of stellar evolution®®. Nitrogen enhancement is often accompanied by C 
depletion. Therefore, we cannot completely rule out the possibility that MWC 656 
is a B2 III giant with some C depletion (compare to the spectrum of the MK 
standard HD 35468 in Extended Data Fig. 3), but the strength of Sim and Ou 
lines, which are the features most sensitive to temperature in this spectral range, 
strongly supports a B1.5 III classification. 

Armed with the spectral classification, we can now set constraints on the mass of 
the Be star using the several calibrations available in the literature. Based on 
evolutionary tracks’’, giant stars in the range B1-B2 have mass 12-17 Mo. On 
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the other hand, the most precise calibrations are provided by dynamical determi- 
nations in detached eclipsing binaries but unfortunately data on B1-B2 giants are 
very scarce. The closest example is provided by the B1.5 III star in V380 Cyg, where 
a dynamical mass of 13.1 + 0.2 Mo has been reported*’. No further analogies are 
found in a recent exhaustive compilation of detached binaries”*. In any case, robust 
lower limits to the mass of our target are provided by dynamical masses of main- 
sequence stars. For instance, ref. 33 gives 11M for B1 V and 9Mo for B2 V, while 
ref. 34 yields 10M« for B1.5 V. Taking into account all the above, we support a 
mass range of 10-16 Mq for the Be star MWC 656. We also remark that the above 
mass range might be an underestimate because of the large rotational velocity of 
MWC 656. This causes the star to appear cooler and slightly less luminous than an 
object of the same stellar mass rotating at lower speed. This effect could increase 
our estimated mass by ~10-15%, which would raise the black-hole mass even 
more”. 

We can also estimate the distance to MWC656 by comparing the absolute 

magnitudes of B1.5-B2 III stars with the observed brightness, corrected for inter- 
stellar reddening. To estimate the latter, we have constructed the spectral energy 
distribution (SED) of MWC 656 in the ultraviolet and optical bands***”. The SED 
was fitted in the range 150-500 nm (where the contribution of the Be disk is 
marginal) with TLUSTY** and FASTWIND” models reddened by variable amounts, 
and the best result is obtained for colour excess E(B — V) = 0.24. Adopting an 
absolute visual magnitude My in the range —3.7 to —4.5, based on the calibration 
of ref. 31, and V = 8.75 + 0.10 mag (ref. 40), we derive a distance of 2.6 + 0.6 kpc. 
Note that in this calculation we have neglected the contribution to the optical light 
from both the circumstellar Be disk and the black-hole accretion disk. The fact that 
MWC 656 shows photometric variability modulated with the orbital period argues 
for some contribution from non-stellar sources to the optical flux. However, the 
modulation has a semiamplitude of only 0.020 mag, suggesting that these other 
contributions have a very modest effect®” and, therefore, we can safely conclude 
that the Be star dominates the observed optical flux. In any case, the large error in 
our distance estimate encompasses the uncertainty introduced by the small con- 
tribution of non-stellar light sources to the observed optical flux. Incidentally, the 
SED does not show any ultraviolet excess as would be expected if the companion to 
the Be star were a stripped He core. This gives additional support for the presence 
of a black hole in MWC 656. 
X-ray analysis. Inspection of the HEASARC archives at the position of MWC 656 
reveals X-ray observations by ROSAT and Swift. The ROSAT/Position Sensitive 
Proportional Counters (PSPC) observations (0.1-2.4keV) were conducted on 
7-11 July 1993 (orbital phases 0.63 to 0.70) and consisted of five pointings of 
~3-6 ks each. The Swift/X-Ray telescope (XRT) observations (0.3-10 keV) were 
conducted on 8 March 2011 (orbital phase 0.52) and consisted of two 1-ks point- 
ings separated by around 3h. 

We have analysed both data sets using standard procedures within HEASOFT 
V 6.12. Assuming a photon index I’ = 2.0 (typical of black holes in quiescence*’) and 
a hydrogen column density Ny; = 1.4 x 107! cm~? (using the obtained E(B — V) 
and the relation with Ny of ref. 42) the following upper limits (at 90% confidence) 
are obtained: Foi-24 xev< 12X10 % ergcm * s | for ROSAT/PSPC and 
Fo3~10 kev < 4.6 X 10°? ergcm *s' for Swift/XRT. 
y-ray association. MWC 656 has been proposed as the optical counterpart of the 
transient GeV y-ray source AGL J2241+ 4454 (refs 5, 6). The association is, how- 
ever, uncertain because AGL J2241 +4454 was only detected during a 2-day activ- 
ity period and it has a position error circle radius of 0.6° (ref. 9). A few binary 
systems have been detected at GeV and/or TeV energies”, all showing orbitally 
modulated y-ray emission and most of them thought to contain young non-accreting 
pulsars. In contrast, the accreting black hole Cygnus X-1 showed evidence of TeV 
emission for less than one day, and also two different short transient episodes of 
GeV emission*’. The black-hole nature of MWC 656 and its putative flaring y-ray 
activity are reminiscent of those of the well-known black hole Cygnus X-1 but, 
remarkably, the accretion luminosity in MWC 656 is typically lower by more than 
~5 orders of magnitude. 

A candidate black-hole/neutron-star progenitor. The future evolution of 
MWC656 will probably lead to a black-hole/neutron-star binary**. During the 
red giant phase the 13M Be star will expand by several hundred solar radii’, thus 
engulfing the black hole. Mass transfer from the expanding Be star onto the black 
hole will be dynamically unstable and a common envelope will ensue. This is a highly 
dissipative process which leads to spiral-in of the black hole, efficient circularization 
of the orbit and the ejection of the Be star envelope. The outcome of the common 
envelope phase will then be a 2.9M. He star** and the present ~5M black-hole 
companion in a close circular orbit. In the event of a symmetric core collapse, the 
newly born black-hole/neutron-star binary will remain bound because less than 
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half the total initial mass is expelled in the explosion”. In the case of an asymmetric 
supernova, the binary survival will depend on the magnitude and direction of the 
kick. 

Black-hole/neutron-star binaries, which have not yet been detected, are instru- 
mental in providing fundamental tests of gravitational theories, strong sources of 
gravitational waves and prime candidates for the production of short y-ray bursts 
through coalescence’. The fate of MWC 656 as a possible black-hole/neutron-star 
binary is very relevant because it provides tight empirical constraints on detection 
rates for gravitational wave observatories, such as advanced LIGO/VIRGO™. 
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Extended Data Figure 1 | Time evolution of the He 11 4,686 A emission line Maximum velocity occurs at HJD 2455722.2 or photometric phase 0.06. 


in MWC 656. a, Radial velocities obtained from single Gaussian fits to the line —_b, Equivalent width (EW) as a function of time. We used the convention of 
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Extended Data Figure 2 | Diagnostic diagram for the He II 4,686 Aline in _ separation for which the continuum noise starts to dominate. Error bars, 1 s.d. 
MWC 656. It has been computed using the double-Gaussian technique with a Panels display the evolution of the sine wave fitting parameters with Gaussian 


Gaussian width equal to the instrumental resolution full-width at half- separation a. From top to bottom: the systemic velocity y, the sine wave go 
maximum, 55kms_'. The vertical dotted line indicates the Gaussian phase, the velocity semiamplitude K and the control parameter o(K)/K. 
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Extended Data Figure 3 | Classification spectrum of the Be star MWC 656. —_(B1.5 III) and HD 35468 (B2 III) (ref. 54). The standards have been artificially 
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Extended Data Table 1 | Observing log of MWC 656 
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Date Telescope+ Spect. Range # Exp. Integration Res. 
instrument A (sec. ) AX/X 
23 Apr — 28 Jul 2011 LT+FRODOspec 3900-5215 oe 600 5000 
28 May 2012 LT+FRODOspec 3900-5215 1 600 5000 
2—4 June 2012 LT+FRODOspec 3900-5215 3 600 5000 
26 Oct 2012 MT+HERMES 3770-9000 1 900 85000 
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Extended Data Table 2 


Orbital elements derived from radial velocities of the Het! 4,686A and Fell 4,583A lines 


Parameter Heir A4686 ~— Fe tt A4583 
Ps» (days) 60.37 (fixed) 60.37 (fixed) 
To (HJD—2,450,000) 3245.347.5  3243.143.7 
€ 0.08+0.06 0.24+0.08 
w (deg) 351.7+44.4 164.4+22.1 
Y (km eB) —44,5+3.4 —13.5+1.8 
K (km s+) 78.844.6 31.042.4 
asini (Re) 93.7+5.4 35.9+2.8 
f(M) (Mo) 3.02+0.53 0.17+0.04 
o; (km s') 19.2 10.3 


To is the epoch of periastron, e the orbit eccentricity, m the longitude of periastron, ) the systemic velocity, K the velocity semiamplitude, a the semimajor axis, ithe binary inclination, f(M) the mass function and o; the 
rm.s. of the fit. 
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Nanoparticle solutions as adhesives for gels and 


biological tissues 


Séverine Rose!, Alexandre Prevoteau’, Paul Elziére', Dominique Hourdet’, Alba Marcellan’? & Ludwik Leibler? 


Adhesives are made of polymers’ because, unlike other materials, 
polymers ensure good contact between surfaces by covering aspe- 
rities, and retard the fracture of adhesive joints by dissipating 
energy under stress”*. But using polymers to ‘glue’ together poly- 
mer gels is difficult, requiring chemical reactions, heating, pH 
changes, ultraviolet irradiation or an electric field*’”. Here we show 
that strong, rapid adhesion between two hydrogels can be achieved 
at room temperature by spreading a droplet of a nanoparticle solu- 
tion on one gel’s surface and then bringing the other gel into contact 
with it. The method relies on the nanoparticles’ ability to adsorb 
onto polymer gels and to act as connectors between polymer chains, 
and on the ability of polymer chains to reorganize and dissipate 
energy under stress when adsorbed onto nanoparticles. We dem- 
onstrate this approach by pressing together pieces of hydrogels, for 
approximately 30 seconds, that have the same or different chemical 
properties or rigidities, using various solutions of silica nanoparti- 
cles, to achieve a strong bond. Furthermore, we show that carbon 
nanotubes and cellulose nanocrystals that do not bond hydrogels 
together become adhesive when their surface chemistry is modified. 
To illustrate the promise of the method for biological tissues, we 
also glued together two cut pieces of calf’s liver using a solution of 
silica nanoparticles. As a rapid, simple and efficient way to assemble 
gels or tissues, this method is desirable for many emerging tech- 
nological and medical applications such as microfluidics, actuation, 
tissue engineering and surgery. 

Very often, when brought into contact and pressed together, two 
pieces of an elastic gel do not stick and gel-gel friction is very low’*. 
Incorporating supramolecular or covalent reversible bonds in poly- 
mers can produce gels that are self-adhesive” '’. However, manipulat- 
ing self-adhesive gels is not always practical. A solution could be to 
use supramolecular networks that are able to self-heal when cut into 
pieces’”. Dispersing clay particles in polymer solutions or networks 
yields gels that combine high elasticity and toughness with self-healing 
capabilities’*"*. In such gels, the clay particles play the part of reversible 
crosslinks'*”°. Nevertheless, self-healing gels cannot meet all the demands 
of gluing in assembly. At present, gluing gels or biological tissues 
requires complex methods, often involving in situ polymerization®”” 
or electrophoretic transport of polymers to the interface*’. Therefore, it 
is desirable to find new ways of gluing that are practical and adaptable. 
Inspired by the physics of polymer adsorption, we propose to use 
nanoparticle solutions as an adhesive (Fig. 1). 

The design principle is simple: to act as an efficient glue, the particles 
must be adsorbed onto the gel surface. The surface of the particles 
must therefore exhibit an affinity with network chains, that is, the free 
energy gain, ¢, resulting from the adsorption of a monomer to the 
surface of a particle should be comparable with the thermal energy, 
kT (ref. 18). Consider a particle adsorbed onto a surface of a gel having 
swelling degree Q, the swelling degree being defined as the ratio of 
the volume of the swollen gel to the volume of the dry network. The 
number of adsorbed monomers per network strand, n, can be estimated 
using theoretical tools analogous to those developed for adsorption of 


polymer solutions. We expect that there are many monomers adsorbed 
per strand, n > 1, and that for nanoparticles with diameters compar- 
able with the network mesh size, several different network strands are 
adsorbed to the same particle’*. Thus, in the adhesive layer, nanopar- 
ticles act as connectors between gel pieces and gel-chains act as bridges 
between different particles (Fig. 1a). When the adhesive junction is 
strained, adsorbed chains are under tension (for example, the red chain 
in Fig. 1b) and some adsorbed monomers detach from the particle 
surface and relax the tension. Nanoparticles retard failure and ensure 
good adhesion because the energy dissipated during the stress-induced 
desorption process is much greater than «. Indeed, detaching only one 
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Figure 1 | Gluing gels by nanoparticle solutions. a, Schematic illustration 
of the concept of gluing swollen polymer networks together using particles. 
The nanoparticle diameter is comparable with the gel network mesh size. 
Network chains are adsorbed on nanoparticles and anchor particles to gel 
pieces. Particles act as connectors between gel surfaces. Adsorbed chains also 
form bridges between particles. The black arrows indicate the pressure applied 
to squeeze the gel layers together. b, Particle adsorption is irreversible 
because particles are anchored to the gel networks by numerous attachments 
(red, light- and dark-blue strands). At equilibrium or under tension (indicated 
by black arrows) a monomer that detaches from a particle surface (red strand) 
can be replaced by a monomer belonging to the same or a different network 
strand (shown here as a dark-blue strand). Such exchange processes and 
rearrangements allow for large deformations and energy dissipation 

under stress. 


1Physico-chimie des Polyméres et Milieux Dispersés (UMR7615 UPMC/ESPCI/CNRS), Université Pierre et Marie Curie, Ecole Supérieure de Physique et de Chimie industrielles, 75005 Paris, France. 
?Matiére Molle et Chimie (UMR 7167 CNRS/ESPCI), Ecole Supérieure de Physique et de Chimie Industrielles, 75005 Paris, France. 
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monomer releases the tension in the whole segment of N/n monomers 
(N is the number of monomers per network strand or, equivalently, 
the number of monomers between two crosslinks). This mechanism 
resembles one proposed to explain rubber resistance to fracture’, but 
here desorption rather than chain breaking is responsible for stress 
release and energy dissipation. Furthermore, in contrast to chain break- 
ing, which is irreversible, in adsorbed layers there is a constant traffic 
of monomers between the adsorbed and desorbed states’” and a neigh- 
bouring strand (for example, the dark-blue chain in Fig. 1b) may adsorb 
and replace the detached link. These exchange processes enable the 
adhesive junction to sustain large deformation and to dissipate energy”, 
yielding an efficient resistance to interfacial fracture propagation and 
a strong adhesion. 

Gluing by nanoparticles may seem paradoxical given that powder- 
ing surfaces with micrometre-sized particles such as talc provides a 
standard means of preventing self-adhesion. Yet Gent and co-workers 
have shown that when the particle-elastomer interactions are suitably 
controlled, powdering self-adhesive elastomers can increase the self- 
adhesion energy by a factor of two’. We go a step further to show 
that systems mostly composed of solvent and which do not adhere to 
themselves can be bonded by particle solutions. 

To demonstrate the concept and test the importance of gel chain 
adsorption onto particles, we synthesized two hydrogels: S0.1, made 
of poly(dimethylacrylamide) (PDMA) and A0.1, made of polyacryla- 
mide. Both gels have the same crosslinking density and similar swelling 
degree and do not adhere to themselves (Fig. 2a and Extended Data 
Table 1). Polyacrylamide does not adsorb onto silica”* whereas poly 
(dimethylacrylamide) adsorbs readily**. When a 15-1 drop of TM-50 
silica suspension was spread on the PDMA gel surface and another 
PDMA piece was pressed to form a lap junction, a strong adhesion was 
observed after a few seconds of contact (Fig. 2b, c and Supplementary 
Movie 1). In contrast, when using polyacrylamide gels, even when we 
pressed very firmly and for a very long time, we could not create lap 
junctions that held under their own weight, thus confirming that the 
lack of gel chains being adsorbed onto nanoparticles prevents gluing. 

Joints made of SO.1 gel were stronger than the gel itself and failure 
occurred outside the bonding junction. Only when the overlap length 
was made comparable to the ribbon thickness did interfacial failure by 
peeling occur (Fig. 2d and Extended Data Figs 1 and 2). From the mea- 
sured failure force, the adhesion energy’*, G,g,, could be estimated 
to be 6.6 + 1.6Jm ~ for short and thin joints, and 6.2 + 1.4Jm * for 
narrow and thick joints. Strong adhesion was also achieved using silica 


Figure 2 | Lap-shear adhesion tests. a, Lightly crosslinked S0.1 gels stick 

to the table surface and to gloves but they do not stick to themselves. b, By 
spreading a drop of TM-50 silica solution on the gel surface, two gel pieces are 
glued together after being brought into contact for few seconds. c, The glued lap 
joint is able to sustain large deformations. d, The failure force measured by the 
lap-shear adhesion test for lap joints of various overlap length. Red circles 
indicate fracture outside the joint; blue squares indicate interfacial failure by 
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particles of different sizes (Fig. 2e and Extended Data Fig. 3). Adhesion 
strength increased when particle size was increased, although it should 
be noted that changing the size of particles implies variations of the 
surface chemistry as well. 

Indeed, adapting particle surface chemistry provides a powerful tool 
with which to produce adhesion by improving suspension stability and 
by promoting specific interactions such as hydrogen bonding that 
strengthen the particle adsorption to the gel surface. Thus, grafting thy- 
mine to carbon nanotubes produced adhesion. Similarly, CNC1 cellu- 
lose nanocrystals bearing sulphate groups yield an adhesion strength 
comparable with that obtained with nanosilica, whereas CNC2 part- 
icles bearing hydroxyl groups only were useless as a glue (Fig. 2e). 

Gluing strength also depends on the properties of the gel. In tightly 
crosslinked gels, strands are more constrained and there is a higher 
entropy penalty for adsorption’. In addition, the energy dissipation 
mechanisms discussed above are less efficient for short strands (when 
N is small). The adhesion energy G,gy was therefore weaker for more 
rigid gels (Fig. 2f). But, more rigid gels are also more brittle, as shown 
by single-edge notched tensile test results. Hence, in practice, even for 
rigid gels, gluing by nanoparticles is sufficiently strong to allow the 
design of joints that withstand tensile stresses without adhesive failure. 
It is also possible to glue gels with very different rigidities strongly 
together (Extended Data Fig. 4). 

Gluing by nanoparticle solutions is not limited to hydrogels that are 
in the as-synthesized state. Dehydration of the gels before gluing is 
expected to increase adhesive strength, whereas gel swelling should 
have the opposite effect. Indeed, in a more swollen state, the strands are 
more stretched and adsorb less readily. However, by choosing particles 
(AL-30) of the ideal size and gel affinity, we were able to glue together 
completely swollen S0.1 gels containing as much as 97.6 v/v% of water. 
Adhesion energy was as high as 1.6 + 0.6J m 7 (Extended Data Fig. 5). 

The design principle ensures that adhesion remains when the joint is 
immersed in excess solvent and swells. Indeed, particles once adsorbed 
stay strongly anchored to the gel surfaces, the probability of total desorp- 
tion of a gel strand being exponentially small'*”’: exp(—ne/kT). Scan- 
ning electron microscopy confirmed that adsorbed nanoparticles cover 
the surface densely, even after soaking the gel in pure water and wash- 
ing the surface several times (Fig. 3a). When a lap joint made of S0.1 
hydrogels glued with TM-50 nanoparticles was immersed in water it 
withstood a fivefold volume increase without failure (Fig. 3b). The 
adhesion energy as measured by a lap-shear test at maximum swelling 
was 1.8+0.5Jm “, that is, 3.5 times lower than in the as-synthesized 


d L 1 


Failure force (mN) 
a 
Oo 
i 
T 


0 1 0 
Overlap length (mm) 


| a 


9 
om Lah Ne) Re ai De, 
ll =H a 
. oi 
on 5 


peeling. e, Failure force, F, normalized by the width of the joint, w, for lap joints 
glued using solutions of various particles. Red bars indicate fracture outside 
the joint; blue bars indicate failure by peeling (mean; error bars are s.d.). 
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F/w (N m*) 
3 
! 


G (J mr) 


16 JANUARY 2014 | VOL 505 | NATURE | 383 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


10 20 
Displacement (mm) 
Figure 3 | Water-resistant and self-repairing glue. a, Scanning electron 
micrograph showing that the TM-50 silica layer adsorbed on a S0.1 gel surface 
was not removed by multiple washing and soaking of the gel in water for several 
days. b, After reaching the maximum swelling and being immersed in deionized 
water for four weeks, the PDMA S0.1/S0.1 assembly glued together using 
TM-50 silica held well. c, Lap-shear test showing that after adhesive interfacial 
failure by peeling (blue line), the S0.1/S0.1 joint glued using HS-40 silica solution 
was repaired by bringing ribbons back into contact with finger pressure for 
about 30s (red line). d, Gelatine/PDMA S0.1 junction glued with TM-50 silica 
after immersion in water for one week to reach maximum equilibrium swelling. 


state (Extended Data Fig. 5 and Supplementary Video 2). In the fully 
swollen state the adhesion strength is weaker because detaching adsorbed 
chains is easier as a result of the higher swelling tension the adsorbed 
chains are already under. Moreover, once detached, a swollen strand is 
less prone to adsorb and strand-strand exchange dissipation processes 
are hindered. The adhesive joint made by adsorbing nanoparticles (Fig.1) 
ressembles a slice of a nanocomposite gel. Hence, a nanocomposite 
PDMA/silica gel can be considered to be a bulk model of the adhesive 
joint. The highly reduced deformation energy dissipation observed in 
a fully swollen nanocomposite gel seems to confirm the above inter- 
pretation of decrease of adhesion when the gel is swollen (Extended 
Data Fig. 5). 

The strong irreversible anchoring of once-adsorbed particles sug- 
gests the attractive possibility of self-repairing or re-positioning adhes- 
ive joints. Figure 3c shows an example of a joint, which was peeled, but 
that can recover its initial strength when the ribbons are brought back 
into contact and pressed with fingers for a few seconds, without any 
need to re-apply the glue. 

Particle solutions offer a simple method of gluing gels of different 
chemical nature, provided that the particle surface chemistry is prop- 
erly adjusted to allow adsorption on both gels. For example, using the 
TM-50 silica solution, a robust assembly of S0.1 and gelatine was achieved 
(Extended Data Fig. 4). In many applications (such as actuation), gluing 
gels of different rather than identical chemical nature together presents 
advantages, such as the possibility of assembling gels with different 
rigidity, but similar equilibrium swelling. Such assemblies can with- 
stand swelling in excess water (Fig. 3d). In contrast, the swelling of a 
glued assembly of chemically identical gels with mismatched swelling 
capacities can lead to high, heterogeneous osmotic stresses near the 
interface, resulting in slow interfacial failure (Extended Data Fig. 4). 

Soft biological tissues, although they are incomparably more complex, 
both mechanically and osmotically, resemble gels in many respects. To 
test the gluing potential of nanoparticle solutions we cut two ribbons 
45mm X 18mm X 3 mm of calfliver. Cut pieces do not adhere to each 
other and cannot be glued by water at pH 9. We spread 60 il of silica 
TM-50 solution on the cut surface (without any pre-treatment or 
special drying) to make a lap joint with overlap length /= 20mm. 
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Figure 4 | Gluing biological tissues. a, Two pieces cut from calf liver were 
glued together by spreading TM-50 nanosilica solution between them and then 
pressing them with a finger. After 30 s of contact the assembly was manipulable 
and the bond held well. b, Normalized force-displacement curves for lap joints 
made of ribbons cut from calf liver and glued by spreading TM-50 silica 
solution between them and then pressing ribbons with a finger for 30s. The 
ribbons were cut with a scalpel blade and no treatment was applied to liver 
surfaces before gluing. Results for two livers are presented. 


After being pressed for 30s with a finger, the lap joint held strongly 
and could be manipulated with ease (Fig. 4a). Lap-shear adhesion 
tests on two livers yield adhesion energy Gagn ~ 25 + 5J m 7 and 
Gadh ~6£1.6Jm” (Fig. 4b and Supplementary Video 3). 

The results suggest that nanoparticle solutions provide a simple way 
of assembling synthetic and biological hydrogels as well as biological 
tissues without affecting substantially the rigidity or permeability of the 
assembly. Powerful methods exist to tune and control the surface chemi- 
stry of inorganic particles and latexes to achieve optimal particle adsorp- 
tion and bonding. The possibility of self-repairing and re-positioning 
peeled adhesive joints is an additional boon. Given the importance of 
wet adhesion in biomedicine and biotechnology as well as in more 
traditional coating and material technologies, our results suggest ways 
to develop new applications by simply assembling many kinds of 
chemically and mechanically mismatched tissues and gels. 


METHODS SUMMARY 


Silica Ludox TM-50, HS-40 and SM-30 water solutions with, respectively, con- 
centrations of 52 wt%, 40 wt% and 30 wt% at pH 9, pH 9.5 and pH 10, SiO2/Na,0 
ratios of 200-250, 89-101 and 45-56, and radii of about 15nm, 9nm and 5nm, 
were purchased from Aldrich and used as received. Stober silica particles”, AL-30, 
with radius 50 nm were synthesized and dissolved in water at 30 wt% (pH = 8.5). 
Multi-wall carbon nanotubes were supplied by Arkema (Graphistrength C100) and 
purified with sulphuric acid. Thymine-grafted carbon nanotube particles (CNT- 
Thy) were synthesized using the method of ref. 28. Cellulose nanocrystals CNC1 
bearing sulphate and hydroxyl groups were prepared using the method of ref. 29. 
Cellulose nanocrystals CNC2 were prepared using the same method, but replacing 
sulphuric with hydrochloric acid. The suspensions were diluted to the desired 
concentrations (0.5 wt% and 3 wt% for carbon nanotubes and cellulose nanocrys- 
tals, respectively) were sonicated for 30 min just before use. 

PDMA and PDMA/silica nanocomposite gels were prepared using the method 
of ref. 15. The polyacrylamide A0.1 hydrogel was prepared by in situ free radical 
polymerization of acrylamide using thermal dissociation of potassium persulphate 
(KPS) as initiator, at 80 °C. N,N’-methylenebisacrylamide (MBA) was used as the 
crosslinker; the MBA/dimethylacrylamide ratios were 0.1 mol.%, 0.5 mol.%, 
1mol.% and 1.5 mol.%, for samples S0.1, $0.5, $1.0 and S1.5, respectively, and 
MBA/acrylamide was 0.1 mol.% for the A0.1 gel. At the preparation state, the gel 
matrix hydration was fixed at 87.7 wt%. Gelatine (Technical 1, VVR) gels were 
prepared at 23 wt% in aqueous solutions. 

Scanning electron micrographs were obtained using a Field Emission scanning 
electron microscope (Hitachi SU-70). Lap-shear and mechanical tests were performed 
on an Instron 5565 machine. Single lap-shear geometry was used for adhesion 
tests. Gluing was achieved by applying a contact pressure of 10kPa for 30s. 
Fracture energy G, was measured with the single-edge notch tensile test*®. 
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Data are available in the online version of the paper; references unique to these 
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METHODS 

Particles. Silica Ludox TM-50, HS-40 and SM-30 water solutions with, respect- 
ively, concentrations of 52 wt%, 40 wt% and 30 wt% at pH 9, pH 9.5 and pH 10, 
SiO,/Na,0 ratios of 200-250, 89-101 and 45-56, and radii of about 15 nm, 9nm 
and 5 nm, were purchased from Aldrich and used as received. Silica particles AL-30 
with radius about 50 nm were synthesized using the method of ref. 27 and dissolved 
in water at 30 wt% (pH = 8.5). 

Multi-wall carbon nanotubes were supplied by Arkema (Graphistrength C100) 
and purified with sulphuric acid. Thymine-grafted carbon nanotube particles 
(CNT-Thy) were synthesized using the method of ref. 28. 

Microgranular cellulose (cotton) was purchased from Aldrich. Cellulose nano- 
crystals CNC1 bearing sulphate and hydroxyl groups were prepared using the 
method of ref. 29. Cellulose nanocrystals CNC2 were prepared using the same 
method, but replacing sulphuric with hydrochloric acid. The suspensions were 
diluted to the desired concentration (0.5 wt% and 3 wt% for carbon nanotubes and 
cellulose nanocrystals, respectively) were sonicated for 30 min just before use. 
Gels and calf liver. PDMA gels were prepared following the method of ref. 15. 
N,N'-methylene bisacrylamide (MBA) was used as the crosslinker and the MBA/ 
dimethylacrylamide ratio was 0.1 mol.%, 0.5 mol.%, 1.0 mol.% and 1.5 mol.% for 
samples S0.1, $0.5, $1.0 and S1.5, respectively (Extended Data Table 1). Potassium 
persulphate (KPS) and N,N,N’,N’-tetramethylethylenediamine (TEMED) were 
used as redox initiators. At the preparation state, the gel matrix hydration was 
fixed at 87.7 wt%. To avoid network defects that lead to a weak self-adhesion of gels 
it is important to conduct the synthesis under nitrogen conditions. The PDMA/ 
silica nanocomposite gel containing 21 v/v% of TM-50 silica was prepared follow- 
ing the method of ref. 15. 

Polyacrylamide hydrogels were prepared by in situ free radical polymerization 
of acrylamide using thermal dissociation of KPS as initiator, at 80 °C. The MBA/ 
acrylamide ratio was 0.1 mol.% for the A0.1 gel. The A0.1 gel hydration at pre- 
paration was fixed at 87.7 wt%. Two aqueous solutions were prepared: KPS at 
4.5 wt% and MBA at 1.2 wt%. The samples were prepared by first dissolving 
MBA and acrylamide at 25 °C in water. The KPS solution was then added and 
the homogeneous solution was purged with nitrogen for 15 min under magnetic 
stirring. The mixture was finally transferred, under a nitrogen atmosphere, into 
laboratory-made moulds previously sealed and put under a nitrogen atmosphere. 
The sealed moulds were placed in an oven at 80°C for 24h. 

Gelatine (Technical 1, VVR) gels were prepared by dissolving the gelatine 
powder in deionized water under stirring at 50°C for 2h. The gelatine concen- 
tration was 23 wt.%. The mixture was then poured into moulds and left at room 
temperature for 30 min. Moulds were stored at 6 °C for two days before testing and 
allowed to come to room temperature for 1 h before testing. 

Swelling experiments were performed in deionized water and the swollen weight 
of the gel was recorded over four days to ensure that the gel was at equilibrium. Gels 
of intermediate swelling degree (Extended Data Fig. 5a) were obtained by immers- 
ing as-synthesized gels in deionized water for some time (30 min, 1h, 2h and 4h, 
respectively) followed by a re-equilibration of samples over 24h. The amount of dry 
polymer was estimated from the amount of monomer, assuming 100% conversion. 

Fresh calf livers were purchased from M. Bonjean, rue Montagne Ste Genevieve, 
Paris and used as received. Samples for tensile and lap-shear adhesion tests were 
cut with a scalpel to the desired dimensions. No pre-treatment or special drying 
was applied to samples or surfaces before the tensile and adhesion tests. 


Tensile tests. Tensile tests were performed on a tensile Instron machine (model 
5565) equipped with a 10 N load cell and with a video extensometer, which follows 
the local displacements of two spots. The experiments were performed at room 
temperature at strain rate of 0.06 s_'. For PDMA and polyacrylamide, gel ribbons 
with dimensions of 40mm X 5mm X 2mm were used. For gelatine, to avoid 
systematic failure in the vicinity of the clamps, samples were cut using a dog- 
bone-shaped die cutter following the ISO4661-1 standard with the reduced section 
of the samples having dimensions of of 25mm X 4mm X 2mm. For calf livers, 
tensile tests were performed on ribbons with dimensions 45 mm X 18mm X 3 mm. 
The elastic moduli of the two livers shown in Fig. 4b were respectively 15.0 + 1.7 kPa 
and 12 + 1.5kPa. 

Adhesion tests. Lap-shear tests were performed on an Instron 5565 machine 
equipped with a 10 N load cell at a speed of 150 mm min '. We used single lap- 
shear geometry (Extended Data Fig. 1a). Displacement was measured by a video 
extensometer that followed two markers (white dots), which were placed at a 
distance of 5 mm from the edge of the lap joint unless otherwise stated in the text. 
The total length of assembled ribbons was 40 mm. Unless otherwise stated, gluing 
was achieved by spreading a nanoparticle solution droplet of volume proportional 
to the overlap surface with a volume of 0.3 pl per square millimetre. The contact 
pressure was 10 kPa and contact time 30s. 

In Fig. 2c the overlap length and width are 20 mm and 10 mm, respectively, and 
the ribbon thickness is h = 2mm. The data of Fig. 2d were obtained for ribbons 
with w = 5mm andh = 2 mm. The data of Fig. 2e were obtained for lap joints with 
overlap length of /= 5mm made of ribbons with w = 5mm and h = 2mm. The 
data of Extended Data Figs 1b, c, 4a and 5b were obtained for lap joints with 
overlap length of 10mm made of ribbons with w = 5mm and h = 2mm. 

For the lap-shear test, when failure occurred by interfacial peeling, we evaluated 

the adhesion energy from the measured adhesive failure force F using the express- 
ion for short lap joints”®, Gan = 3(F/ w)?/(2Eh), where wand h denote, respectively, 
the width and thickness of the ribbon, and Eis the tensile modulus. For PDMA S0.1 
gels glued using TM-50 silica particles we found Gag to be 6.6 + 1.6Jm * and 
62+14Jm ’, respectively, for short and narrow (/ = 2mm, w = 5mm, h = 2mm) 
and thick (= 5mm, w = 2mm, h = 5 mm) joints. For S0.5 and S1.0 gels (Fig. 2f) 
the lap-joint dimensions were / = 5mm, w= 5mm, h = 2mm. 
Self-repair tests. The data of Fig. 3c were made of PDMA S0.1 ribbons of w = 5 mm 
and h = 2mm glued by spreading a 6 ll droplet of HS-40 silica solution between 
them. The initial length of the assembly before the deformation is about 28 mm. 
The overlap length was 5 mm. After adhesive interfacial failure by peeling, the joint 
was repaired by bringing ribbons back to contact and pressing with fingers for 
about 12s and the lap-shear test was performed again to test the strength of the 
repaired junction. Similar tests were carried out on gels glued together using other 
particles (for example, CNC1) and the ability to self-repair was always observed. 
Fracture tests. The fracture energy data of Fig. 2f were obtained using the single- 
edge notch geometry. A cut was made in the centre of the samples using a blade. 
Each notch was measured by optical microscopy to determine its exact length a, of 
approximately 1 mm. The same procedure as the one used for tensile tests was 
performed: the strain rate was fixed to be 0.06 s-', and the force and the displace- 
ment data were recorded. The fracture energy was calculated using the following 
expression: G, = 6aW/(A,)"”?. Here W is the strain energy density, approximated 
by the surface area under the (engineering) stress—/ curve, where / is the extension 
ratio, and A, denotes the extension ratio at the breaking point”. 
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Extended Data Figure 1 | Lap-shear and tensile tests of PDMA S0.1 gels. 
a, Lap-joint geometry. Displacement was measured by a video extensometer 
that followed two markers (white dots), which were placed at a distance of 

5 mm from the edge of the lap joint. The total length of the assembled ribbons 
was 40 mm. w denotes the width and h the thickness of gel ribbons. / is the 
overlap length. b, Comparison of force-displacement curves for PDMA S0.1 
ribbon (black line) and for the lap joint glued by spreading 15 pl of TM-50 silica 
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solution (blue circles). Displacement was measured by optical extensometer 
from two markers, initially spaced by 20 mm and centred on the joint. The 
PDMA S0.1/S0.1 assemblies broke outside the joint (inset). c, Lap-shear 
adhesion test reproducibility. Force-displacement curves of PDMA S0.1/S0.1 
lap joint illustrating lap-shear test reproducibility. All gel ribbons were cut from 
the same plate. Bulk failure outside the joint was systematically observed. 
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Extended Data Figure 2 | Measurement of adhesion energy of PDMA S0.1 
gels. a, Lap-shear test geometry in which interfacial failure by peeling was 
observed for S0.1 gel ribbons glued by spreading 6 ,1l of TM-50 silica solution 
(/= 5mm, w= 2mm and h=5 mm). b, Force-displacement curves for 
PDMA S0.1/S0.1 lap joints. Adhesive failure by interfacial peeling was 
observed. All gel ribbons were cut from the same gel plate and the tensile 
modulus was measured to be E ~ 8.1 + 1.0 kPa (error is s.d.). From the 
measured failure force, the adhesion energy can be estimated to be 
6.2+14Jm ° (error is s.d.). 
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Extended Data Figure 3 | Lap-shear test of PDMA S0.1 gels glued by various 
particle solutions. In order of increasing deformation at the breaking point are 
shown the adhesive failure by interfacial peeling in CNT-Thy (grey), CNC1 
(green), SM-30 (yellow) and HS-40 (purple). Fracture outside the junction 
occurred for TM-50 (blue) and AL-30 (red). Lap-joint dimensions were 
1=5mm, w=5mmandh=2mm. 61U of solution was spread to make 

the junction. 
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Extended Data Figure 4 | Gluing gels of different stiffness or chemical 
nature. a, Force-displacement curve for an assembly made of soft PDMA S0.1 
and rigid PDMA S1.5 (red triangles) gels glued by TM-50 solution. For 
comparison the results obtained under identical conditions for the symmetric 
PDMA S0.1/SO assembly are plotted (blue circles). Lap-joint dimensions were 
1= 10mm, w= 5mm and h = 2mm. 15 pl of TM-50 solution was spread to 
make the junction. b, Glued at their preparation state, both PDMA S0.1 and 
PDMA S1.5 gels initially had the same size (diameter of about 10 mm). The 
picture shows gels after 5h of swelling in deionized water. The highly 
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crosslinked PDMA S1.5 gel (top piece) is less swollen than the PDMA S0.1 gel 
(bottom piece). Interfacial stresses induced by heterogeneous overswelling 
considerably exceed the shear stresses applied in the mechanical lap-shear test 
ofa and lead to interfacial failure during immersion and overswelling in water. 
De-bonding was slow. c, Lap-shear force-displacement trace for the gelatine 
and S0.1 PDMA gel assembly (green stars) glued by spreading TM-50 silica 
solution. The failure occurred outside the lap joint and the fatal crack 
propagated in tension mode. 
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Extended Data Figure 5 | Gluing overswollen gels and overswelling glued 
gels. a, Adhesion energy of joints made of PDMA S0.1 hydrogels swollen 
before being glued with AL-30 silica solutions to various degrees of swelling Q 
(in blue) and adhesion energy of joints made of S0.1 hydrogels glued with 
TM-50 silica solutions at as-synthesized swelling degree (Qo ~ 8.5) and after 
being immersed in water and swollen to reach the maximum, equilibrium 
swelling degree, Q. ~ 41 (in red) (error bars are s.d.). When S0.1 gels were glued 
with AL-30 particles at the as-synthesized state, bulk failure systematically 
occurred outside the junction even when the joints were very short, narrow and 
thick. To induce peeling (interfacial failure) making cuts at the interface was 
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necessary. b, Lap-shear test for PDMA S0.1/S0.1 assembly glued by TM-50 
silica at the preparation state, Qo (blue circles) and after swelling in water for 
three days and attaining maximum equilibrium swelling Q. (red circles). 

c, Mechanical behaviour of a nanocomposite sample that can be considered as a 
model of the interfacial adhesive layer. Engineering stress is plotted as a 
function of strain. The loading—-unloading cycle at the gel preparation 
conditions (green line) shows a large hysteresis. At equilibrium swelling 

(red dashed line) the hysteresis and dissipation were very weak. Low dissipation 
at swelling equilibrium is responsible for a relatively weaker adhesion after 
immersion in water. 
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Extended Data Table 1 | Hydrogels 


Preparation state Swelling equilibrium conditions 
Water DMA AAm MBA KPS TEMED Silica Gelatine QE (kPa) Q E — Swelling ratio 
Sample —_(g) (9) (9) (mg) (mg) (ul) viv (9) ‘ ° (kPa) Qe! Qo 
S0.1 10.62 1.485 23 41 22.5 85 10+1 4141 4+1 5.1 
NC 10.62 1.485 2.3 41 22.5 0.21 85 93+8 29+4 7+1 3.4 
$0.5 10.69 1.485 11.5 41 22.5 85 26+1 2342 2.7 
$1.0 10.77 1.485 23.0 41 22.5 85 4542 17+2 2.0 
$1.5 10.94 1.485 34.5 41 22.5 85 60+4 15+2 1.7 
AO.1 7.62 1.065 23 41 9.0 11+1 
Gelatine 11.10 3.31 5.3. 30+2 29+2 49 


Errors are s.d. AAm denotes acrylamide monomer, TEMED denotes N,N,N’,N’-tetramethylethylenediamine. 
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Asymmetric synthesis from terminal alkenes by 
cascades of diboration and cross-coupling 


Scott N. Mlynarski’, Christopher H. Schuster! & James P. Morken! 


Terminal, monosubstituted alkenes are ideal prospective starting 
materials for organic synthesis because they are manufactured on 
very large scales and can be functionalized via a broad range of che- 
mical transformations. Alkenes also have the attractive feature of 
being stable in the presence of many acids, bases, oxidants and reduc- 
tants. In spite of these attributes, relatively few catalytic enantiose- 
lective transformations have been developed that transform aliphatic 
a-olefins into chiral products with an enantiomeric excess greater 
then 90 per cent. With the exception of site-controlled isotactic poly- 
merization of g-olefins', none of these catalytic enantioselective pro- 
cesses results in chain-extending carbon-carbon bond formation 
to the terminal carbon’ *. Here we describe a strategy that directly 
addresses this gap in synthetic methodology, and present a single- 
flask, catalytic enantioselective conversion of terminal alkenes into 
a number of chiral products. These reactions are facilitated by a neigh- 
bouring functional group that accelerates palladium-catalysed cross- 
coupling of 1,2-bis(boronates) relative to non-functionalized alkyl 
boronate analogues. In tandem with enantioselective diboration, 
this reactivity feature transforms alkene starting materials into a 
diverse array of chiral products. We note that the tandem diboration/ 
cross-coupling reaction generally provides products in high yield and 
high selectivity (>95:5 enantiomer ratio), uses low loadings (1-2 mol 
per cent) of commercially available catalysts and reagents, offers an 
expansive substrate scope, and can address a broad range of alcohol 
and amine synthesis targets, many of which cannot be easily addressed 
with current technology. 

Development of catalytic enantioselective reactions that operate 
efficiently with low catalyst loadings and high levels of selectivity is a 
paramount challenge in organic chemistry. This challenge is even greater 
when one targets the transformation of «-olefins that have a small steric 
bias between the prochiral faces of the alkene. For this reason, there are 
few catalytic asymmetric processes that operate effectively with ali- 
phatic terminal alkenes. We sought to address this significant gap in 
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Figure 1 | The diboration/cross-coupling (DCC) strategy and potential 
applications. a, The DCC cascade. An efficient cross-coupling reaction that 
applies to alkyl pinacol boronates would enable conversion of terminal alkenes 
to a broad array of useful building blocks. b, Prospective targets. The DCC 
reaction followed by oxidation provides an alternative to carbony] allylation for 


synthesis methodology by developing a catalytic enantioselective reac- 
tion that converts terminal alkenes into chiral reactive intermediates; 
in this manner, one might introduce a number of useful catalytic asym- 
metric reactions simultaneously. A first step in the development of this 
strategy was achieved in engineering a Pt-catalysed enantioselective 
alkene diboration (Fig. 1a)’. In this Letter we present remarkably effi- 
cient cross-coupling reactions that apply to diboration products and 
collectively provide a strategy for enantioselective carbohydroxylation, 
carboamination and bisalkylation of terminal alkenes. These strategies 
enable the construction of many biologically significant molecules and 
should allow practicing chemists to assemble target structures in new 
ways. For example, the homoallylic alcohol embedded within the frame- 
work of the cytotoxic natural product epothilone C (Fig. 1b) might be 
accessed by a diboration/cross-coupling (DCC) reaction followed by 
oxidation. Alternatively, diboration followed by cross-coupling and 
amination could provide a new route to structural variants of the thera- 
peutic agent tamsulosin from propene as a feedstock. Last, hydrocarbon 
stereocentres such as the one appearing in the antitumour macrolide 
kendomycin can be forged by DCC reaction followed by homologation 
of the remaining boronate. 

The Pt-catalysed enantioselective diboration of terminal alkenes with 
B,(pin), (here pin indicates pinacolato) offers a platform for the con- 
struction of new molecular ensembles. In tandem with diboration, 
oxidation transforms terminal alkenes to enantiomerically-enriched 
1,2-diols. A far greater range of new molecular building blocks would 
arise from terminal alkenes if 1,2-bis(pinacol boronates) would directly 
participate in efficient cross-coupling. Although related cross-couplings 
with bis(catechol boronates) are known’, conversion of terminal alkenes 
to enantiomerically enriched 1,2-bis(catechol boronates) is generally 
not enantioselective. Therefore, a strategy for terminal alkene manip- 
ulation based on selective diboration reactions requires successfully 
engaging alkyl pinacol boronates as nucleophilic partners in Suzuki- 
Miyaura cross-coupling’. However, contrary to commonly employed 
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alkyl boranes and boronic acids, alkyl pinacol boronates are generally 
recalcitrant substrates in such processes'®. Indeed, the only reported 
cross-coupling with a bis(pinacol boronate) involved two equivalents 
of a highly activated organic electrophile’’. The contrasting reactivity 
between classes of boron reagents can be traced to a difference in 
transmetallation rates during the catalytic Suzuki cross-coupling reac- 
tion (Fig. 2a). Meticulous mechanistic studies'*’* are in concert with 
prior assertions”* and suggest that one operative mechanism for trans- 
metallation involves pre-association of a Pd(hydroxide) with a neutral 
trivalent boron centre. Accordingly, it can be surmised that the dimin- 
ished Lewis acidity of alkyl pinacol boronates relative to other boron 
derivatives lies at the root of the reactivity difference between these 
reagents. Thus, engaging pinacol alkyl boronates in cross-coupling has 
required the use of toxic bases’° or pre-formed anionic four-coordinate 
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Figure 2 | Observations on the Pd-catalysed cross-coupling of 
1,2-bis(boronates) with bromobenzene. a, Generalized mechanism for the 
palladium-catalysed Suzuki-Miyaura cross-coupling reaction. L,, = ligands. 
b, Cross-coupling of bromobenzene with alkyl pinacol boronates in the 
presence of RuPhos (structure shaded) shows a pronounced rate enhancement 
due to the presence of a vicinal boronate. Note that for entry 4, 5 mol% 
Pd(OAc),/RuPhos was employed for 20h. Cy = cyclohexyl. c, A direct 
competition experiment suggests that the rate enhancement occurs at the first 
irreversible step or any step that precedes it. 
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“ate” complexes'®. Recent advances” in the design of efficient ligands 
for metal-catalysed cross-coupling reactions have begun to provide a 
solution to this problem and indeed one recent report’* suggests that 
pinacol boronates can participate in the Suzuki reaction in the presence 
of RuPhos”’, a monodentate phosphine ligand. 

To determine whether 1,2-bis(pinacol boronates) can engage in effi- 
cient cross-couplings with unactivated organic electrophiles, we exam- 
ined the reaction between isolated and purified bis(boronate) 1 (Fig. 2b) 
and bromobenzene under a wide range of reaction conditions. Optimal 
conditions are depicted in Fig. 2b and reveal that extraordinarily effi- 
cient cross-couplings can be achieved. In the presence of 1 mol% Pd(OAc),, 
1 mol% RuPhos, and employing aqueous KOH as the base for 1h, we 
obtained a 91% yield of alcohol 5 (Fig. 2b) after oxidative work-up. 
Importantly, alcohol 5 was isolated as a single constitutional isomer; 
the product from coupling with the secondary boronate was not detected. 
The reaction of 5 is remarkable in comparison to cross-couplings of 
other pinacol boronates (Fig. 2b). For example, under the same con- 
ditions in which 1 reaches 91% conversion, n-octyl boronate 2 is not 
detectably transformed. Reactions of 1,3-bis(boronate) 3 and 1,4-bis 
(boronate) 4 indicate that the rate acceleration experienced by 1 relies 
not just on the presence of a second boronate unit, but also on its posi- 
tion relative to the first. 

To gain further insight into the special features of the 1,2-bis(pinacol 
boronate), we performed a direct competition experiment where both 
1 and octylB(pin) 2 were subjected to cross-coupling in the same flask 
(Fig. 2c). In this experiment, >95% conversion of the 1,2-bisboronate 
was achieved whereas only a trace amount of product was produced 
from the monoboronate. With the reasonable assumption that trans- 
metallation in the presence of hydroxide is irreversible, the outcome of 
this experiment suggests that the enhanced reactivity of the bis(boro- 
nate) is probably due to an enhanced rate of transmetallation; if trans- 
metallation occurred at similar rates with each substrate but the rate 
retardation with octylB(pin) was due to slow reductive elimination, 
then octylB(pin) should sequester the Pd catalyst and retard the reac- 
tion of both substrates. Thus the presence of the adjacent non-reacting 
boronate appears to have a profound effect on the rate of transmetalla- 
tion, leading to >50-fold enhancement in reactivity of the substrate. 

A number of plausible explanations may account for the reaction 
acceleration by the vicinal boronate in 1, but here we consider two. 
Similarly to the Lewis-base-induced activating effect that an adjacent 
carbonyl has on a reacting boronate”, we considered that cooperative 
binding of hydroxide by the neighbouring boron centres might furnish 
an “ate” complex such as A (Fig. 3); for related observations in cross- 
coupling of geminal boronates, see ref. 21. Alternatively, we considered 
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Figure 3 | Mechanistic considerations for the cross-coupling rate 
enhancement observed with 1,2-bis(boronates). a, Whereas rate 
enhancements in Suzuki-Miyaura couplings might result from internal Lewis 
base donation to the reacting boronate (as in compound A), internal Lewis 
acid activation (as in compound B) may allow the boronate to better bind a 
reactive Pd(OH) species. b, The stereochemical outcome of cross-coupling 
(retention of configuration at carbon) is consistent with an inner-sphere 
transmetallation, suggesting that internal Lewis acid activation is the more 
likely pathway. 
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Figure 4 | Tandem single-pot DCC provides a new route to 
enantiomerically enriched benzylic alcohols from terminal alkenes. Shaded 
area shows the reaction: varying ‘R’ in the reactant gives reaction products 
shown. Under each product are given yield in per cent, and either enantiomeric 
ratio (e.r.) or diastereomer ratio (d.r.). Yield refers to isolated yield of purified 
product and is an average of two experiments (individual experimental 


that the function of the adjacent boron atom might be to act as a Lewis 
acid, coordinating to the pinacolato oxygen (B) thereby enhancing the 
Lewis acidity of the neighbouring boron centre; in line with the discus- 
sion above, this might enhance the reactivity of the primary organo- 
boronate. The stereochemical outcome of the cross-coupling reaction 
might provide a clue to the operative reaction mechanism: with four- 
coordinate boron in A, transmetallation would necessarily occur by an 
outer-sphere path and result in inversion of configuration at the prim- 
ary carbon”. Alternatively, reaction via B would most probably occur 
by an inner-sphere path and occur with retention of configuration’*”. 
To test these proposals, the cross-coupling of isotopically labelled 
substrate 9 and chloroisobutylene was examined and it was found to 
occur with retention of configuration at carbon, providing 10 as the 
product. The outcome of this labelling experiment suggests that an inner- 
sphere transmetallation may operate, perhaps involving association of 
a Pd(hydroxide) complex with B. 

Conveniently, both alkene diboration and catalytic cross-coupling 
can be accomplished in a one-pot protocol, transforming simple ter- 
minal olefins to secondary alcohols after oxidation. Using bromoben- 
zene as a model electrophile, 1-octene was found to successfully engage 
in the tandem sequence and provided homobenzylic alcohol 11 after 
oxidation of the cross-coupled product (Fig. 4). Replacement of bro- 
mobenzene with chlorobenzene also resulted in effective conversion to 
11 (88% yield); however, reactions with either iodobenzene (30% yield) 
or phenyltriflate (48% yield) gave lower yields of the desired product. 
In addition to 1-octene, a number of other olefins were successfully 
engaged in the tandem sequence with bromobenzene as the electro- 
phile, affording the products in high yield and with high enantioselec- 
tivity (products 12-18). Both linear and branched aliphatic substrates 
as well as those containing pendant olefins and silyl ethers were well 
tolerated. Olefins derived from allylic or homoallylic alcohols under- 
went smooth reaction and importantly, when either hydrocarbon-based 
or oxygen-based f stereocentres are present (products 14-17), effective 
catalyst control produces the products in excellent diastereomer ratios 
(17:1 to >20:1 d.r.). Examination of other aromatic electrophiles showed 
that electron-poor and electron-rich arenes as well as those that are steri- 
cally encumbered readily couple (19-22). Importantly, heteroaromatics 
can also be used and give highly enantioenriched adducts (23-25). 

Homoallylic alcohols are strategically important compounds in syn- 
thetic organic chemistry, and significant resources have been directed 
towards their asymmetric construction. Almost exclusively, these motifs 
are accessed by allylation of carbonyl electrophiles with nucleophilic 
reagents”*. An alternative route to these structures from terminal alkenes 
becomes available when vinyl electrophiles are engaged in the tandem 
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cross-coupling for 23 and 24 employed 1 equiv. LiCl. (R,R)-3,5-diethylphenyl- 
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DCC reaction. In preliminary studies, we investigated the coupling of 
vinyl bromides, but these most often occurred with lacklustre reaction 
efficiency (for example, 12% yield of 26 from the vinyl bromide). Whereas 
vinyl iodides were also ineffective (<5% yield), the reactions of vinyl 
chlorides were highly effective and furnished homoallylic alcohols in 
excellent yields on oxidative work-up (Fig. 5a). Because the olefin stere- 
ochemistry is retained during the course of Suzuki cross-coupling reac- 
tions, the DCC reaction provides ready access to homoallylic alcohols 
bearing configurationally defined trisubstituted (27 and 28), cis- and 
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Figure 5 | The DCC tandem sequence provides access to synthetically useful 
chiral homoallylic alcohols that are not readily prepared by carbonyl 
allylation reactions. a, Construction of substituted homoallylic alcohols by 
DCC reaction/oxidation. Shaded area shows reaction: products are shown 
under. Variation of the chloroalkene employed in the cross-coupling gives 
differently substituted alkenes in the product. b, Use of dichloroethane allows 
for in situ formation of vinyl chloride and provides an effective route to 
unsubstituted homoallylic alcohols and amines (shaded). Yield and e.r. are 

as defined in Fig. 4 legend. Cbz = carboxybenzyl. 
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Figure 6 | DCC tandem reactions provide short new synthesis routes to 
important medicinal agents. These routes employ new feedstocks relative to 
existing routes and can facilitate new structure-activity relationship studies. 
a, Preparation of Boc protected (S)-amphetamine (shaded). b, Preparation of 


trans-disubstituted (26 and 29) and cyclic alkenes (30 and 31) ina 
stereoselective fashion; construction of these substituted motifs with 
contemporary carbonyl allylation methods, when possible, requires 
specialized difficult-to-access reagents. Last, we considered that unsub- 
stituted homoallylic alcohols might be accessed from vinyl chloride; 
however, handling this toxic and gaseous electrophile is cumbersome 
and requires specialized equipment. We found that a straightforward 
alternative arises from the use of dichloroethane: under the basic reac- 
tion conditions, this inexpensive liquid reagent is presumably converted 
to vinyl chloride and engages in cross-coupling (Fig. 5b). After oxidative 
work-up, we isolated unsubstituted homoallylic alcohol 32 in outstanding 
yield and enantiomeric excess. Of note, homoallylic amines can also be 
accessed with the DCC strategy by subjecting the purified DCC inter- 
mediate to amination rather than in situ oxygenation. This allowed 
construction of 33 from 1-octene (Fig. 5b) also in outstanding levels of 
enantioselectivity and good yield; for recent efficient catalytic enantio- 
selective imine allylation to give allyl amines see ref. 24. 

The versatility of the catalytic DCC reaction provides a rapid route 
to important product motifs from simple alkene feedstocks. To demon- 
strate the power of this strategy, we targeted a diverse array of biologically 
important molecules. For example, chiral phenethylamine derivatives 
are broadly active pharmaceutical agents most often produced from 
ketone precursors”. Alternatively, as the sequence in Fig. 6a indicates, 
the DCC reaction allows N-tert-butoxycarbonyl-(S)-amphetamine to be 
prepared quickly from the commodity chemical propene as the starting 
material. Efficient diboration is followed by cross-coupling to give sec- 
ondary boronate 34. Subjection of 34 to stereospecific direct amination”® 
and Boc protection furnished the target in excellent enantiomeric purity. 
Importantly, the modular nature of this new route to phenethylamine 
derivatives could dovetail with strategies for high-throughput synthesis 
and provide rapid access to new derivatives for biological studies. Using 
the one-pot DCC in conjunction with boronate homologation”’ and 
oxidation converts (Fig. 6b) propene to primary alcohol 35. Chlori- 
nation followed by Sy2 reaction furnished the potent fungicide (S)- 
fenpropimorph with high levels of enantioenrichment (Fig. 6b)**. Another 
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application involves construction of lignan lactones, a broad class of 
natural products that exhibit a wide range of biological activity. Star- 
ting from commercially available safrole, we constructed key inter- 
mediate 36 in three steps: DCC reaction, boronate homologation and 
oxidative olefin cleavage (Fig. 6c). Simple alkylation of lactone 36 is 
known to furnish a variety of lignan natural products such as isodeox- 
ypodophyllotoxin and isostegane”’. Last, we note that the DCC reac- 
tion can be used to give 37; homologation, amination and protection 
of 37 furnished 38 (Fig. 6d). Subsequent oxidative olefin cleavage and 
deprotection of 38 provides a new route to the pharmaceutical agent 
Lyrica (pregabalin) from a unique set of organic chemical building blocks 
(Fig. 6d)*°. 

In summary, the catalytic enantioselective diboration of terminal 
alkenes, combined with Pd-catalysed cross-coupling, provides a flexible 
platform for the construction of a broad array of chiral compounds 
from non-functionalized terminal alkenes. Although application of 
this methodology in target-oriented synthesis is easy to foresee, when 
one considers the tremendous variety of «-olefins and aryl/vinyl elec- 
trophiles that are available, the DCC reaction sequence should also 
provide new strategies for diversity-based synthesis. In addition to its 
direct impact on molecular synthesis, the studies presented here define 
a unique reactivity characteristic of 1,2-bis(pinacol boronates). We 
anticipate that the enhanced reactivity of the 1,2-bis(boronate) in trans- 
metallation reactions may enable a range of other important enantio- 
selective terminal alkene transformations that will have an impact on 
the field of chemical synthesis. 


METHODS SUMMARY 

The general procedure for the one-pot DCC reaction as described in Fig. 4 is as 
follows. Pt(dba)s (1.0 mol%), (R,R)-L1 (1.2 mol%), Ba(pin)2 (1.05 equiv.) and anhy- 
drous THF ([substrate] = 1.0 M) are stirred together at 80°C for 15 min. After 
cooling to ambient temperature, the alkene (1.0 equiv.) is added and the reaction 
mixture is stirred at 60 °C for 3h. On cooling to ambient temperature, Pd(OAc), 
(1.0 mol%), followed by RuPhos (1.0 mol%), the electrophile (1.5 equiv.), KOH 
(3.0 equiv.), additional THF and deoxygenated water ([substrate] = 0.1 M; 10:1 v:v 
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THE:H,0) are added and the reaction mixture is heated to 70°C for 12h. The 
reaction is then cooled to 0 °C and treated with 3 M aqueous NaOH and 30% H20o. 
After 4 h at ambient temperature, excess HO; is carefully quenched with saturated 
aqueous Na,S.,Os, followed by extraction with ethyl acetate. The combined organics 
are dried over Na2SO,, filtered and concentrated. The resulting material is purified 
by flash chromatography on silica gel. For complete experimental details and char- 
acterization of all new compounds, see Supplementary Information. 
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A signature of transience in bedrock river incision 
rates over timescales of 10*-10’ years 


Noah J. Finnegan', Rina Schumer? & Seth Finnegan? 


Measured rates of river incision into bedrock are commonly inter- 
preted as proxies for rates of rock uplift (see refs 1 and 2, for example) 
and indices of the strength of climatic forcing of erosion over time (see 
refs 3 and 4, for example). This approach implicitly assumes that river 
incision rates are in equilibrium with external forcings over a wide 
range of timescales. Here we directly test this assumption by examining 
the temporal scaling of bedrock river incision from 155 independent 
measurements of river incision compiled from 14 sites. Of these sites, 
11 exhibit a negative power-law dependence of bedrock river incision 
rate on measurement interval, a relationship that is apparent over 
timescales of 10*-10’ years and is independent of tectonic and geo- 
morphic setting. Thus, like rates of sediment accumulation’, rates of 
river incision into bedrock exhibit non-steady-state behaviour even 
over very long measurement intervals. Non-steady-state behaviour 
can be explained by episodic hiatuses in river incision triggered by 
alluvial deposition, if such hiatuses have a heavy-tailed length distri- 
bution®. Regardless of its cause, the dependence of incision rate on 
measurement interval complicates efforts to infer tectonic or climatic 
forcing from changes in rates of river incision over time or from 
comparison of rates computed over different timescales. 

Unglaciated topography is shaped by competition between tectonic 
uplift and bedrock river incision’. The potential for rivers to grow steeper, 
convey more water via orographic precipitation and thus become more 
erosive with higher rates of tectonic uplift suggests that rates of surface 
erosion should evolve to match rates of rock uplift in actively uplifting 
ranges®. At the same time, climate-driven changes in sediment supply 
and water discharge to rivers are thought to modulate rates of vertical 
river incision into rock over geologic time””°. These arguments imply that 
measured rates of bedrock river incision can constrain active tectonic 
processes as well as temporal variability in the strength of climate forcing 
of erosion. 

Measured rates of land-surface change in aggradational (formed by 
sediment deposits) settings, in contrast, exhibit a negative power-law 
dependence of accumulation rate on measurement interval that is not 
directly attributable to tectonic or climatic forcing"’. This ‘Sadler effect’ 
arises when the duration—frequency distribution of hiatuses between 
intervals of accumulation has a heavy-tailed distribution®. Such dis- 
tributions emerge from a variety of stochastic sediment accumulation 
models®. Over measurement intervals smaller than the longest hiatus, 
sedimentary sequences incorporate longer hiatuses at longer time- 
scales and thus average accumulation rates tend to decline with mea- 
surement interval. It is unclear whether information about external 
climatic or tectonic forcing can then be recorded®. For example, an 
analysis of clastic coastal and continental shelf deposits indicates that 
only for measurement intervals exceeding 10*-10° years do rates of 
sediment accumulation cease declining and therefore begin to reflect 
tectonic subsidence rates’. This measurement interval corresponds to 
the longest recorded hiatus. 

A bedrock river is commonly conceptualized as an alluvial river bed 
overlying a bedrock channel bed that is incised only when the stripping 
of deposited alluvial material exposes bedrock to processes of abrasion, 


weathering and plucking’. Hiatuses in incision begin when the alluvial 
bed aggrades and end when its elevation returns to that of the bedrock 
channel, for example following flood scour’? (Fig. 1a, b). If the long time 
series of alluvial bed elevation can be described as a stochastic process®””, 
then the return time to bedrock will follow a power-law frequency dis- 
tribution in the long time limit°. This is true whether aggradation and 
scour events follow a simple random walk process"’, a long-range corre- 
lated random process, or a random walk marked by power-law periods 
between deposition or erosional events'*. Under this model, bedrock 
incision rate will decline with measurement interval according to a nega- 
tive power-law relationship (Fig. 1c). The measurement interval over 
which a system exhibits negative power-law rate scaling is related to 
the longest physically possible hiatus duration in the period of record. 
Only over timescales exceeding the duration of the longest possible hiatus 
can changes in river incision rate be confidently interpreted in terms of 
tectonic or climatic forcing. 

It is widely recognized that mass-wasting triggers rapid and deep 
alluvial bed aggradation in bedrock channels following earthquakes", 
fires'® and large storms’’. Additionally, both rapid scour and deposition 
of alluvium occurs during floods in bedrock channels. Given the stoch- 
astic nature of these processes governing alluvial bed elevation change’’, 
the potential exists for a negative power-law dependence of bedrock river 
incision rates on measurement interval. Although numerical modelling 
suggests that steady-state bedrock incision arises over relatively short 
measurement intervals (10° years) despite stochastic forcing of sediment 
supply”, a review of bedrock incision records in the southeastern USA” 
observed a negative scaling of bedrock incision rates with measurement 
intervals of 10°-10’ years. Another study observed negative rate scaling 
for a variety of erosion processes over similar timescales’. Therefore, the 
fidelity of the process of bedrock river incision as a recorder of external 
climatic and tectonic forcing remains uncertain. 

To test for a dependency of bedrock river incision rate on measure- 
ment interval, we compiled 14 bedrock river incision data sets that 
each span at least one order of magnitude in time (see Methods and 
Supplementary Table 1). To avoid the spurious correlation that arises 
from plotting a rate against its own denominator (measurement interval), 
we computed the power-law relationship between cumulative bedrock 
incision and measurement interval for each of the 14 data sets (see 
Methods). The temporal scaling of cumulative bedrock incision can then 
be related to the temporal scaling of bedrock incision rate by subtracting 
one from the cumulative incision versus measurement interval power- 
law exponent, herein referred to as f (Fig. 1c)°. Of the 14 data sets, 11 
exhibit values of f that are less than one, implying a negative power-law 
dependence of incision rate on measurement interval (Fig. 2; Extended 
Data Table 1). For the entire data set, the mean power-law exponent 
relating cumulative river incision and measurement interval is about 
0.8 (implying a rate versus measurement interval exponent of about 
—0.2). In addition, we find that the apparent negative power-law depend- 
ence of incision rate on measurement interval persists over four orders of 
magnitude in time (10*-10’ years) (Fig. 3). Because tectonically inactive 
rivers tend to preserve longer incision records than tectonically active 
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Figure 1 | The connection between stochastic alluvial bed elevation change, 
cumulative bedrock incision and incision rate scaling. a, Random-walk 
simulation of alluvial bed elevation (black line) and cumulative bedrock 
incision (dashed green line), assuming incision rate is constant when the bed is 
unburied. b, Bedrock canyon evolution implied by simulation. c, Plot of the 
logarithm of cumulative incision (dashed green line) and the logarithm of time- 
averaged incision rate (solid blue line) versus the logarithm of the measurement 
interval calculated from the synthetic incision record in a. Adjacent to each 
curve is the form of the power-law relationship of that variable (cumulative 
bedrock incision or time-averaged bedrock incision rate) to measurement 
interval in terms of the power-law exponent relating cumulative incision to 
measurement interval, f. 


rivers, it is impossible to completely de-convolve measurement interval 
and tectonic setting. Nevertheless, we see no evidence to suggest that the 
observed scaling is strongly influenced by tectonic setting (Fig. 3; Extended 
Data Table 1). Lastly, we note that a negative power-law dependence of 
incision rate on measurement interval is apparent regardless of the par- 
ticular landform used to constrain river incision (Extended Data Table 1). 
Because preservation of paired strath terraces is promoted by channel 
narrowing, which is a common response to accelerating incision”, it is 
conceivable that the terrace record could be biased towards settings with 
accelerating incision and hence negative power-law rate scaling. However, 
a negative power-law dependence of incision rate on measurement inter- 
val is also recorded in unpaired terraces, caves and incised volcanic depos- 
its, which should not be subject to the same preservation bias because they 
do not require valley narrowing for preservation. Consequently, it is un- 
likely that our findings simply reflect a preservation bias. 

Rates of bedrock incision recorded during floods” and over short mea- 
surement intervals” are usually too large to be sustained over geologic ti- 
mescales, implying that long hiatuses must separate intervals of incision. 
Evidence that such hiatuses have a power-law distribution in time (and 
are therefore probably stochastic in origin) is supported by the negative 
power-law dependence of incision rate on measurement interval that we 
observe over 10*-10’ years. Physical evidence for extremely long incision 
hiatuses in bedrock exists in the Himalayas, where valley fills can persist 
for over 10° years (ref. 24). Therefore it is possible that stochastic forcing 
from processes of sediment transport and delivery may be recorded in 
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Figure 2 | Cumulative bedrock incision as a function of measurement 
interval. Log-log plot of cumulative incision versus measurement interval for 
the 14 data sets. Best-fit lines reflect the mean slopes and intercepts derived 
from the error analysis. Dashed lines indicate slopes (that is, /) greater than or 
equal to one. Solid lines indicate slopes less than one. 


rates of bedrock incision even over measurement intervals exceeding 
10° years. 

It has also been argued that gravel aggradation modulated by Quat- 
ernary climate changes provides an external mechanism for generating 
incisional hiatuses in bedrock channels*"®. In this case, return time dis- 
tribution theory would predict a levelling out of incision rates at the 
period of the forcing and thus a transition to an exponent of 1 on a plot 
of cumulative incision versus measurement interval. For example, in 
numerical experiments in which incision hiatuses are driven by gla- 
cial—interglacial climate cycles, rates of vertical incision show little change 
once averaged over the 10°-year interval that corresponds to the dom- 
inant period of late Pleistocene glaciations’”. In contrast, we find that the 
negative power-law dependence of incision rate on measurement inter- 
val persists over four orders of magnitude in time (10°-107 years) (Fig. 3), 
and—importantly—over intervals much longer than the period of any 
known periodic climate forcing. This suggests that, globally, hiatuses in 
river incision into bedrock may not be coupled in a simple linear way to 
periodic Pleistocene climate forcing, as has been frequently suggested. 

If incisional hiatus length is instead imposed by something intrinsic 
to the process that generates hiatuses (for example, maximum landslide 
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Figure 3 | Cumulative bedrock incision versus measurement interval 
exponent fas a function of incision record length. Data points are the mean 
exponent calculated from the error analysis. Error bars represent +1o derived 
from the error analysis. Data points plotted above the steady-state incision line 
reflect rates of incision that increase with measurement interval. Data points 
plotted below the steady-state incision line reflect rates of incision that decrease 
with measurement interval. 
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size), there is no evidence to support the existence of such a limit in the 
data examined here. Long-term average rates of bedrock incision are 
thus apparently inseparable from the interval over which they are mea- 
sured. Before incision rates at different locations can be meaningfully 
compared, the rate-measurement interval scaling of each location must 
first be quantified and then the incision rates need to be normalized to a 
particular measurement interval. The data compiled here also suggest 
that comparisons between river incision rates and, for example, ther- 
mochronologic ages or geodetic rates of surface uplift may be compli- 
cated by the fact that incision rates depend on measurement interval, as 
well as tectonics and climate, as also argued by ref. 20. However, to the 
extent that stochastic variation in sediment input to channels is region- 
ally coherent”’, the incision histories of nearby catchments might be 
quite similar, even if they do not record tectonic or climatic forcing ina 
straightforward way. 

Our inference that the dependence of incision rate on measurement 
interval reflects the influence of episodic alluvial aggradation is sup- 
ported by the observation that bedrock river channels are usually 
covered in debris, even in tectonically active settings'*”°’’. The incision 
scaling observed on the Blue Nile river’’, if it arises from power-law 
hiatuses, implies an instantaneous incision of about 0.7 m in 1 year, but 
only 116m after 10° years. Thus, only during 0.2% of a 10°-year-long 
record (that is, 166 years) would any channel incision occur. In other 
words, as observers, we are far more likely to encounter a channel in a 
state of non-incision. This suggests that an understanding of the pro- 
cesses that generate incisional hiatuses is arguably more important to 
understanding rates of landscape change than is an understanding of the 
incision process itself, as has also been argued by ref. 12. Our analysis also 
suggests that apparent recent increases in river incision rates should be an 
expected consequence of measuring bedrock river incision rates over 
measurement intervals that do not incorporate long hiatuses, as argued 
by ref. 10. 

Do the same issues that complicate the interpretation of river incision 
rates also affect landscape-scale erosion rates? A study of sediment yields 
in steep, mountainous catchments on decadal measurement intervals 
suggests that landscape-scale erosion rates (as opposed to fluvial incision 
rates) are biased towards slow rates over short measurement intervals 
because of infrequently sampled catastrophic erosion events”. This scal- 
ing is the opposite of what we observe in the case of river incision, where 
shorter measurement intervals tend to yield higher rate estimates. One 
potential explanation for this apparent contradiction is that materials 
eroded from hill slopes are subsequently deposited in channels, where 
they may cause hiatuses in bedrock channel incision. Infrequent, large 
hill-slope erosion events could therefore trigger infrequent but long- 
duration hiatuses in bedrock channel incision. That said, in general land- 
scape-scale erosion rates show little apparent dependence on measure- 
ment interval”, an observation that has been attributed to the averaging 
out of local stochasticity with increasing spatial scale of measurement”. 
Thus, the influence of locally stochastic erosional processes on temporal 
rate scaling may depend both on the specific process in question and on 
the spatial scale examined. 


METHODS SUMMARY 


We compiled 14 bedrock river incision data sets that span at least one order of 
magnitude in time (Supplementary Table 1), encompassing a total of 155 measure- 
ments of river incision. We define tectonically active settings as those regions with 
documented tectonically driven rock uplift. Each of the tectonically active data sets 
was also taken from a paper (see the Extended Methods for citations) that measured 
river incision to constrain active tectonic processes. Tectonically inactive settings did 
not meet these criteria. We use reported or estimated uncertainties for each mea- 
surement of cumulative bedrock incision and measurement interval to define the 
uncertainty for each data point in each data set. We performed a Monte Carlo error 
analysis for each data set in which we calculated 3,000 linear fits between the 
logarithm of cumulative bedrock incision and the logarithm of measurement inter- 
val. Measurement interval for a cumulative river incision measurement is equivalent 
to the age of the landform used to constrain incision. This is because cumulative river 
incision for all data points is computed between the modern channel elevation and 
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the palaeo-channel elevation. We do not calculate incision between dated landforms 
because such estimates do not represent statistically independent measurements of 
incision. For each model iteration, we assigned incision and age errors by selecting 
randomly from a normal error distribution for each data point with a standard 
deviation corresponding to the reported or estimated uncertainty in landform age 
and elevation. Slope and intercept distributions were created for each data set from 
the results of the Monte Carlo simulation to define power-law exponents relating 
cumulative incision and measurement interval, as well as corresponding uncertain- 
ties. Because our explanatory variable (measurement interval) has significant uncer- 
tainty, we use a total-least-squares regression method to quantify slope. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


To test for a dependence of river incision rate on measurement interval, we compiled 
14 bedrock river incision data sets, each of which spans at least one order of mag- 
nitude in time (Supplementary Table 1), encompassing a total of 155 independent 
measurements of river incision. First we describe the incision data sources. Then, we 
describe the statistical analyses performed on the data to constrain the relationship 
between cumulative bedrock incision and measurement interval for the 14 data sets. 
We provide the location and citation for each of the 14 incision data sets used in 
the analysis. In addition, we describe how both landform ages and uncertainties 
and cumulative bedrock incision values and uncertainties were determined for 
each data set. Of the 14 data sets, we identified tectonically active settings as those 
regions with documented tectonically driven rock uplift. In addition, each of the 
tectonically active data sets was taken from a paper that measured river incision to 
constrain active tectonic processes. Areas identified as tectonically inactive did not 
meet these criteria. 
Middle Gorge of the Indus river, Pakistan. We used the weighted average ages and 
corresponding uncertainties and elevations reported in table 2 of ref. 3. Elevation 
uncertainty was assigned according to section 3.1 of ref. 3. We ignored point 5 in 
table 2 of ref. 3, following the interpretation of the authors. 
Clearwater river, Olympic Mountains, Washington (state), USA. We used the 
ages and age ranges from figure 4 of ref. 30 to define the ages and age uncertainties 
for terraces. We digitized the terrace elevations from figure 4 of ref. 30 and used the 
reported elevation uncertainties in table 1 of ref. 30. 
Umpqua and Siletz rivers, Oregon Coast Range, Oregon, USA. We used the ages 
and elevations along with their corresponding uncertainties as reported in table 1 
of ref. 31. We selected only the Siletz and Umpqua River data sets as these span the 
longest period of time. 
Bidente and Musone rivers, Northern Apennines, Italy. We used the ages and 
elevations along with their corresponding uncertainties as reported in tables 2 and 
4 of ref. 32. 
Yellow river, Linxia basin, eastern Tibetan plateau margin, China. We used the 
ages and bedrock elevations reported in table 1 of ref. 33. We assume a 10% un- 
certainty for age and a 5m uncertainty in elevation, in keeping with the largest 
observed uncertainties in other data sets. 
Sierra Nevada caves, California, USA. We used the ages and elevations along 
with their corresponding uncertainties as reported in table 2 of ref. 34. 
Trachyte creek, Colorado plateau, Arizona, USA. We used strath ages and 
corresponding uncertainties reported in table 3 of ref. 35. Strath elevations were 
obtained from table 4 of ref. 35. We assigned an uncertainty in elevation of 5 m. 
Grand Canyon, Colorado river, Colorado plateau, Arizona, USA. To avoid com- 
paring reaches with different rates of tectonic forcing, we selected only data iden- 
tified by the authors of ref. 36 as part of the “Western [Fault] Block” of the Grand 
Canyon because it is the longest record presented. Ages and uncertainties as well as 
strath elevations were obtained from table 2 of ref. 36. Strath elevation uncertainties 
were estimated to be half of the maximum pool depth reported in table 2 of ref. 36. 
Yellow river, Jinshaan canyon, Oordos plateau, China. Strath ages and uncertain- 
ties were obtained from table 1 of ref. 37. Terrace elevations were selected from the 
Qilangwo locale in table 2 of ref. 37. Uncertainties in elevation were assumed to 
be 5m. 
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Grand Canyon of the Nile, Blue Nile river, Ethiopia’*. Ages and elevations were 
digitized from figure 5 of ref. 28. Uncertainties of 10% were assumed for age, and 
5m uncertainties were assigned to elevations. 
Mammoth cave, Green river, Kentucky, USA. Ages and uncertainties were 
obtained from table 2 of ref. 38. Elevations were obtained from table 3 of ref. 38. 
Uncertainties in elevation were assigned according to the Methods of ref. 38. 
Cumberland river, Cumberland plateau, Kentucky and Tennessee, USA. Ages and 
uncertainties as well as elevations were obtained from table 1 of ref. 39. Uncertainties 
in elevation were assigned to be 2 m, because of the similarity of this study to ref. 38. 
We performed a Monte Carlo error analysis for each data set, in which we calcu- 
lated 3,000 linear fits between the logarithm of cumulative bedrock incision and the 
logarithm of measurement interval. The measurement interval for a river incision 
measurement is equivalent to the age of the landform used to constrain incision. This 
is because cumulative river incision for all data points is computed between the 
modern channel and the palaeo-channel. We do not calculate incision between dated 
landforms because such estimates do not represent statistically independent measure- 
ments of incision. For each model iteration, we assigned incision and age errors by 
selecting randomly from a normal error distribution for each data point with a 
standard deviation corresponding to the reported or estimated uncertainty in land- 
form age and elevation. Slope and intercept distributions were created for each data 
set from the results of the Monte Carlo simulation to define power-law exponents 
relating cumulative incision and measurement interval, as well as corresponding 
uncertainties. Because our explanatory variable (measurement interval) has signifi- 
cant uncertainty, we use a total-least-squares regression method to quantify slope. 
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Extended Data Table 1 


Location 


Reference 


Cumulative 
incision versus 
measurement 
interval 


exponent (f) 


Power-law fits, tectonic setting and landform type for each data set 


Length of 
incision 
record (kyr) 


Tectonic 
setting 


Landform 


Middle gorge of the Indus river, 
Pakistan 


0.67 


65 


Active 


Unpaired strath 
terraces 


Clearwater river, Olympic 
Mountains, Washington (state), USA 


Paired and 
unpaired strath 
terraces 


Umpqua river, Oregon Coast Range, 
Oregon, USA 


Active 


Unpaired strath 
terraces 


Siletz river, Oregon Coast Range, 
Oregon, USA 


Active 


Unpaired strath 
terraces 


Bidente river, Northern Apennines, 
Italy 


Active 


Paired and 
unpaired strath 
terraces 


Musone river, Northern Apennines, 
Italy 


Active 


Paired and 
unpaired strath 
terraces 


Yellow river, Linxia Basin, eastern 
Tibetan plateau margin, China 


Active 


Strath terrace 


Sierra Nevada caves, California, USA 


Inactive 


Cave 


Trachyte creek, Colorado plateau, 
Arizona, USA 


Inactive 


Paired and 
unpaired strath 
terraces 


Grand Canyon, Colorado river, 
Colorado plateau, Arizona, USA 


Inactive 


Miscellaneous 


Yellow river, Jinshaan canyon, 
Oordos plateau, China 


Inactive 


Strath terrace 


Gorge of the Nile, Blue Nile river, 
Ethiopia 


Inactive 


Dated volcanic 
rocks 


Mammoth cave, Green river, 
Kentucky, USA 


Inactive 


Cave 


Cumberland river, Cumberland 
plateau, Kentucky and Tennessee, 
USA 
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River systems connect the terrestrial biosphere, the atmosphere 
and the ocean in the global carbon cycle’. A recent estimate sug- 
gests that up to 3 petagrams of carbon per year could be emitted as 
carbon dioxide (CO,) from global inland waters, offsetting the car- 
bon uptake by terrestrial ecosystems’. It is generally assumed that 
inland waters emit carbon that has been previously fixed upstream 
by land plant photosynthesis, then transferred to soils, and subse- 
quently transported downstream in run-off. But at the scale of entire 
drainage basins, the lateral carbon fluxes carried by small rivers 
upstream do not account for all of the CO, emitted from inundated 
areas downstream**. Three-quarters of the world’s flooded land con- 
sists of temporary wetlands’, but the contribution of these product- 
ive ecosystems* to the inland water carbon budget has been largely 
overlooked. Here we show that wetlands pump large amounts of 
atmospheric CO, into river waters in the floodplains of the central 
Amazon. Flooded forests and floating vegetation export large amounts 
of carbon to river waters and the dissolved CO) can be transported 
dozens to hundreds of kilometres downstream before being emit- 
ted. We estimate that Amazonian wetlands export half of their gross 
primary production to river waters as dissolved CO, and organic 
carbon, compared with only a few per cent of gross primary pro- 
duction exported in upland (not flooded) ecosystems’. Moreover, 
we suggest that wetland carbon export is potentially large enough to 
account for at least the 0.21 petagrams of carbon emitted per year as 
CO, from the central Amazon River and its floodplains®. Global 
carbon budgets should explicitly address temporary or vegetated 
flooded areas, because these ecosystems combine high aerial prim- 
ary production with large, fast carbon export, potentially support- 
ing a substantial fraction of CO, evasion from inland waters. 

In the global carbon cycle, rivers act not only as vectors from land to 
ocean but also as significant sources of CO, to the atmosphere. The 
amount of carbon that leaves the terrestrial biosphere through inland 
waters is much larger than the amount that ultimately reaches the ocean’. 
There is a growing consensus that outgassed carbon from inland waters 
originates from land, where it was fixed by terrestrial plants, then 
recycled within soils and finally exported to surface waters'’. In water, 
the respiratory destruction of terrestrial organic carbon predominates 
over photosynthetic production”®. However, other sources such as soil 
and groundwater COs, which are particularly significant in small streams, 
or wetland carbon, are necessary to balance CO; evasion at the global 
scale**. Over the past few years, estimates of the outgassing flux have 
substantially increased, in parallel with the development of remote 
sensing tools able to capture flooding’. Of the maximum 15% of global 
land area occupied by water, 9% is seasonally flooded®. Temporary and 


vegetated waters are grouped under the generic term of wetlands. Wetlands 
are among the most productive ecosystems on Earth® and behave as net 
carbon sinks owing to their efficient storage of carbon in waterlogged 
soils’. Yet they have been considered atmospheric CO. sources in 
inland water inventories”*. Therefore, a better understanding of the 
carbon flows through the boundaries of wetlands with uplands, the 
atmosphere and rivers is crucial to determine an accurate continental 
carbon budget. 

In this study, we examined the central Amazon River-floodplain 
system, which comprises the open waters and wetlands located in the 
1.77-million-km’ reference quadrant of the central Amazon basin that 
have been characterized in detail with Synthetic Aperture Radar (SAR) 
imagery” (Fig. 1b). Approximately 14% of this quadrant is occupied by 
wetlands; that is, temporary or vegetated waters in the floodplain, which 
are 96% inundated during high-water periods and 26% inundated during 
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Figure 1 | Study area. a, The Amazon Basin. b, The Central Amazon reference 
quadrant (1.77 million km”). c, The section of the Amazon River and the five 
floodplains investigated in this work. d, The Cabaliana and Janauaca 
floodplains on the Solimoes river. e, The Canagari and Miratuba floodplains on 
the Amazon River. f, The Curuai floodplain on the Amazon River. 
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low-water periods. Forest occupies about 70% of the entire flooded area 
during high-water periods’. These waters emit 210 + 60 teragrams of 
carbon (Tg of C) per year as CO;, of which approximately half is from 
the open waters and the other half is from the wetlands*. We combined 
high-resolution field measurements of the partial pressure of carbon 
dioxide (p,o,) with remote sensing data on the extent of water and 
vegetation in the floodplain and mainstream. We focused on a section 
approximately 800 km long located in the farthest-downstream area in 
the reference quadrant (Fig. 1c). This section is of particular interest 
because the four characteristic biotopes of the central Amazon River 
(flooded forest, deep channels, shallow open floodplain lakes and float- 
ing macrophytes) are all well represented, although the proportions are 
different from those in the reference quadrant (Extended Data Table 1). 


In addition, this section shows a well-defined biogeographic gradient 
from flooded forests that are dominant upstream to open lakes that are 
dominant downstream’*’’. During eight periods from 2007-2011, we 
measured the water peo, continuously in the main channels of the 
Amazon River and its major tributaries and in five floodplain lakes 
(Fig. 1d-f). Our instrumental set-up was best suited to assess extreme 
spatial heterogeneity, including most of the remote and shallow water 
bodies (Extended Data Fig. 1). Except for some clear waters, such as the 
Tapajos River, the p,o, in the main river channels was consistent with 
previous reports® and varied within one order of magnitude, from 
approximately 1,000 to 10,000 parts per million by volume (p.p.m.v.) 
(Extended Data Table 2). In floodplain lakes, the poo, varied by three 
orders of magnitude, from approximately 20 to 20,000 p.p.m.v. The 
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Figure 2 | CO, advection from flooded forest to open waters. a, The process 
of CO, advection could be tested using high-resolution p.., measurements 
performed in the Cabaliana Lake region, where waters have a single 
unidirectional eastward flow (from flooded forest (light grey) to open waters 
(dark grey and black)). b, Theoretical p.o, distributions in floodplains (top 
graph) and river channels (bottom graph) calculated by the one-dimensional 
advection model (Supplementary Methods). The p,o, curves start at a high po, 
of 12,000 p.p.m.v. within wetlands and account for variable transport and 
degassing intensities. The models are labelled according to the parameters 
chosen: for example, model k2w40H8 indicates that the gas transfer velocity k is 
normalized to a Schmidt number of 600 and set to 2cmh_ !, the water velocity 
wis set to 40cms ', and the water depth H is set to 8m. Various settings 
for these three parameters corresponding to typical conditions in rivers and 
floodplains have been applied (Supplementary Information and Extended Data 
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Fig. 5). c, The track of the ship in Cabaliana Lake and the Solimées river channel 
in May 2008. Brown arrows indicate the dominant water flows into the lake. 
d, Results of the measurements obtained while underway during the cruise. 
The conductivity distribution shows that waters from the Solimées river 
channel were dominant in the western part of Cabaliana Lake but were mixed 
with waters from the Manacapuru River in the eastern part of the lake. 

The water pco, showed a clear decrease from west to east as the distance from 
the flooded forest increased. The solid black line shows the theoretical pco, 
distribution modelled by considering a normalized gas transfer velocity of 
12cmh 1, a water velocity of 10cm s ! anda water depth of 5 m, as was 
observed during the cruise. The differences in poo, at the centre of the lake 

in the morning and the afternoon were attributed to photosynthesis by 
phytoplankton, which were abundant in the lake. 
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extreme heterogeneity in the floodplains (Extended Data Figs 2 and 3) 
was related to the connections between the waters and vegetation at all 
spatial and temporal scales. At the centre of the open lakes, the water 
Pco. was highest during high-water periods in the floodplains sur- 
rounded by large flooded forests, whereas the lowest values occurred 
in almost totally isolated lakes during low-water periods and were 
associated with a high phytoplankton biomass (Extended Data Table 2). 
There was a net downstream decline in p.o, along the biogeographic 
gradient; the upstream forested floodplains and channels of the Solimdes 
River always showed higher saturation values than the downstream 
open lakes and channels of the Amazon River. Finally, within indi- 
vidual lakes and during any season, p-o,increased consistently from 
open waters to the vicinity of floating macrophytes and flooded forests. 

Wetland vegetation can support the outgassing of CO) in the sur- 
rounding water through two mechanisms, which differ in the form of 
carbon transported by the waters and in the site where respiration 
occurs (Extended Data Fig. 4). First, litterfall and root exudation release 
labile organic carbon to the water'*'*, where the organic carbon is 
further decomposed; in this organic carbon pathway, heterotrophic 
metabolism and outgassing occur concomitantly in the open waters. 
Second, submerged roots and microbial respiration in wetland soils 
release CO, to the water’’; this CO, is then transported to open areas, 
where outgassing occurs. We investigated the potential magnitude of 
this latter CO, pathway with a one-dimensional model that computes 
the distance that wetland CO, is transported before it is emitted (Extended 
Data Fig. 5). We found that the water movement is fast enough relative 
to gas exchange to maintain high supersaturation over dozens to hun- 
dreds of kilometres without requiring heterotrophic metabolism. The 
ability of water masses to export wetland CO, depends on their velo- 
city, depth, vertical mixing and the gas transfer coefficient k. In a river 
channel that is 30m deep with a typical k value® of 15cmh ' anda 
water current of 150cms ', only 18% of the CO, has been evaded 
150 km downstream of the point source (the rest is still dissolved in the 
river water and is transported further downstream). In contrast, in an 
open floodplain lake that is 2 m deep with a typical thermally enhanced 
k value!” of 12cmh ! and a water current of 20cms 1, 90% of the 
wetland CO, is outgassed 20 km downstream. Our model satisfactorily 
mimics the spatial p.o, gradient observed in May 2008 along a 20-km 
transect in Cabaliana Lake, where the water current flows permanently 
eastward after passing over a large flooded forest area (Fig. 2d). 

The CO, outgassing fluxes in individual lakes increase with the 
percentage of the floodplain covered by vegetation (Fig. 3). Lakes that 
were almost isolated from wetlands had nearly neutral daily CO, 
fluxes, whereas lakes connected to large wetlands showed the highest 
outgassing rates. This relationship was valid in three different seasons 
for a given lake, which is consistent with the flood pulse concept’, and 
also during the same season at different lakes because the vegetation 
cover decreases downstream. Floodplain vegetation that performs aerial 
photosynthesis thus drives CO, outgassing. The CO, pathway appar- 
ently dominates in floodplains (Fig. 2), but it also occurs in river 
channels, as indicated by the strong anomalies in the stoichiometry 
between dissolved CO, and oxygen concentrations. Indeed, respiration 
of inundated emergent vegetation can release dissolved CO, to waters 
through their roots with no concomitant drawdown of dissolved oxy- 
gen, which is supplied from the atmosphere through the leaves’*””. Net 
heterotrophy in lakes and main channels frequently accounts for less 
than 20% of the CO, outgassing flux'’”*, the majority of which is 
supported by wetlands CO, from upstream areas (Supplementary Infor- 
mation). Furthermore, the composition of particulate organic matter 
reveals a transfer of algal- and macrophyte-derived organic carbon 
from the floodplains to the river channel’'”’, and the isotopic com- 
position of respired CO, in channels shows seasonally variable con- 
tributions from C3 plants, C4 plants and algal material’®”’, all of which 
are well represented in the floodplains. Flooded forest is also a potential 
major source of biodegradable macromolecules in the Amazon River™. 
Finally, both the organic carbon and CO) pathways are consistent with 
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Figure 3 | Vegetation-mediated control of CO, outgassing in floodplain 
lakes. Symbols correspond to seasons, and colours correspond to sites. CO2 
degassing was computed from the measured p,o, and an average gas transfer 
velocity and was averaged over the surface area of the lake. Vertical bars indicate 
two standard deviations induced by the spatial and diurnal variation within 
each lake at a given season. The surface areas of the open lakes and the 
surrounding flooded and floating vegetation were obtained from SAR images 
taken during three seasons (Supplementary Methods). The thick dotted line 
represents the linear regression line (7° = 0.90, P< 0.0001); the thin dotted 
lines delineate the 95% confidence bands of the regression. 


the very young '“C age of the outgassed CO, compared with the older 
terrestrial organic carbon transported by the Amazon River”. 
Published metabolic rates'*"'°*°?° and the extent of vegetation” in 
the floodplains of the central Amazon reference quadrant allow some 
quantities of carbon supplied by wetlands to the waters to be com- 
puted. Flooded forests and floating macrophytes provide, through 
litterfall and submerged root respiration, a total of —305 + 120 Tg of 
C per year of atmospheric carbon to the waters (Extended Data Table 3). 
This probably conservative estimate is not significantly different from 
the CO outgassing flux of +210 +60 Tg of C per year®. Central 
Amazonian waters thus receive at least as much carbon from semi- 
aquatic plants as they emit to the atmosphere, and the CO, net eco- 
system exchange of the river system as a whole, including wetlands and 
open waters, should be nearly neutral. This is consistent with airborne 
observations over the Amazon, which do not reveal a large unidir- 
ectional CO; outgassing flux from wetland and river surfaces compared 
to uplands”. Water inside and downstream of a wetland functions 
similarly to the way soil functions in a forest, in that it rapidly returns 
a large fraction of the carbon fixed by photosynthesis to the atmosphere. 
Unlike in the headwaters’, in the central Amazon, the contribution of 
strictly upland terrestrial carbon to CO, outgassing is potentially minor 
compared to the wetland carbon contribution. Unlike terrestrial land- 
scapes that export less than 2% of their gross primary production to 
inland waters'”, Amazonian wetlands export half of their gross primary 
production to waters (Extended Data Table 3). Although more quant- 
itative information is needed on the global distribution of wetland 
primary productivity and carbon export, the available information® 
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suggests that wetlands support a large percentage of inland water CO, 
evasion at the global scale. 


METHODS SUMMARY 


Continuous measurements (once per minute) were performed on eight cruises in 
the channels and floodplain lakes of the Amazon on a ship equipped for this pur- 
pose. River water was continuously pumped from about 50 cm below the surface 
and delivered to the different equipment and sensors. Water pco, was measured 
using a marble-type equilibrator” (see online Methods) connected to an infrared 
gas analyser, which was calibrated before and after each cruise. The response time 
was estimated to be 3 min. The water temperature, conductivity and turbidity were 
measured in an overflowing bucket using a multiprobe, which was calibrated every 
10 days. Fluorescence was measured using a BBE Moldaenke Fluoroprobe. The 
position was recorded using a global positioning system (GPS). During low-water 
periods, shallow and remote lakes were surveyed using a small boat anda 12-V version 
of the measurement set-up. The total number of geo-referenced measurements 
was 45,786. The water and flooded vegetation surface areas were derived from 
SAR data acquired using the PALSAR space-borne sensor (see Supplementary 
Methods). Within the five floodplains sampled and for the three campaigns at high, 
intermediate and low water, the flooded areas were divided among open water, 
flooded forest and floating vegetation. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Study site and remote sensing. The central Amazon river-floodplain system is 
defined as the waters and wetlands located in the 1.77-million-km” reference 
quadrant of the central Amazon basin that has been characterized in detail using 
SAR imagery”? (referred to as ‘quadrant b’; Fig. 1b). We focused our fieldwork and 
remote sensing analysis on a secondary 0.288-million-km? quadrant inside the 
eastern limit of quadrant b, which was referred to as ‘quadrant c’ (Fig. 1c). 
Quadrant c is located in the downstream section of the Amazon River in Brazil, 
from the city of Manacapuru on the Solimées River upstream of its confluence 
with the Negro River to the Amazon River in front of the city of Santarem at the 
confluence of the Amazon with the Tapajos River (Fig. 1c). In this region, waters 
flowing from the Andes are rich in suspended matter and nutrients, have the 
highest conductivity and are termed ‘white”°. Waters draining the primarily low- 
land podzol soils are rich in dissolved organic matter and intensely coloured by 
humic material, have low conductivity and are called ‘black’. Finally, relatively 
transparent waters draining the Precambrian Shields are poor in organic and 
inorganic material, have low-to-intermediate conductivity and are called ‘clear’. 
The main tributaries of the Amazon River sampled were the SolimGes and Madeira 
(white), the Manacapuru and Negro (black), and the Urubu and Tapajos (clear). 
The Amazon River flood is characterized by relatively low interannual variability 
but strong water level differences between the low-water (October-November) 
and high-water stages (May-June). The annual water level within the main stream 
reaches approximately 8 m near Santarem station and up to 12 m at Manacapuru 
station on the Solimées River. The floodplains are connected to the river's main 
stem by a complex system of channels (called Paranas) that vary as a function of 
the flood level’. Spilling over the riverbanks usually occurs during the peak 
inundation phase*'’. These environments participate significantly in the transfer 
of water and matter from the Andes to the Atlantic Ocean; approximately 5% of 
the annual water discharge of the Amazon is estimated to travel through the 
floodplain’. 

Five floodplain systems were selected for fieldwork and remote sensing (Fig. 1 
and Extended Data Table 1c). Farthest upstream, the Cabaliana and Janauaca flood- 
plains receive large inputs of waters from the SolimGes River and are surrounded by 
large areas of flooded forest (Fig. 1c, d). In the Cabaliana, the conductivity is similar 
to that of the Solimées (70-100 pS cm™ ), but the turbidity is much lower because 
of sediment retention in the flooded forest that acts as a filter. In the Janauaca, the 
conductivity is slightly lower (33-70 pS cm” *) because of clear water contributed 
by local drainage. Downstream of the Amazon-Madeira confluence, the Canagari 
and Miratuba mainly receive waters from the flooding of the Amazon and Madeira 
rivers, respectively (Fig. le). Miratuba Lake is well surrounded by flooded forest 
and is almost completely dry at low water (Fig. le). The Canagari is a very homo- 
geneous lake with approximately the same depth all over, and is disconnected from 
flooded forest most of the year. The Canacari receives white waters from the Amazon 
and occasionally clear waters from the Urubu River that drains the northern local 
basin. Finally, in the farthest-downstream part of our study area, the Varzea of 
Curuai is composed of more than 30 interconnected lakes temporarily or perma- 
nently connected to the Amazon main stem by small channels* (Fig. 1f). 

We used L-band SAR radar remote sensing images to monitor the extent of 
flooding at different stages in the five floodplains from 2007 to 2010. Several 
studies using a time series of L-band SAR satellite images reported excellent 
classification results in the quadrant under study'***’*. The mapping accuracy 
was shown to increase with the number of images included in the time series 
(up to eight images**), which allowed the flood dynamics to be retrieved with an 
accuracy higher than 80%. The satellite images were acquired by the PALSAR 
space-borne sensor in wide-beam mode with 250-km-wide images and a resolu- 
tion of 100 m. The PALSAR resolution allowed us to quantify the area of large lakes 
and flooded forest, but not small channels, in the floodplain. Because the area 
studied in this work extends over more than 800km from east to west, five 
PALSAR wide-beam frames and eight images per frame were considered. The 
water-level range corresponding to the image acquisition agrees well with the 
maximum water level at the Obidos water gauge on the Amazon River, from 
1.29 m to 8.56 m, and at the Manacapuru water gauge on the Solimoes River, from 
8.41 m to 20.43 m. 

Flood maps were derived using an algorithm that allows the retrieval of the flood 
dynamics and the primary landscape units** with an accuracy better than 80% 
considering all landscape units and better than 90% for the forested areas. All images 
were co-registered and ground-corrected using the Shuttle Radar Topography 
Mission digital elevation model (http://srtm.csi.cgiar.org/) and georeferenced with 
MapReady software, available from the Alaska Satellite Facility. Speckle reduction 
was achieved by both temporal and spatial filtering using a specific algorithm for 
SAR image time series” and the Lee-Sigma filter**. Filtered multitemporal images 
were segmented into homogeneous regions using the multi-resolution segmentation 
algorithm of the eCognition software (http://www.ecognition.com/). The scale and 
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shape parameter values that control the merging procedure were tested and 
applied uniformly to preserve the main floodplain landscape units. Then, a par- 
allelepiped classifier was used to attribute a status regarding flooding to each 
segment: never flooded (NF), occasionally flooded (OF), and permanently flooded 
(PF). The classifier also provides the general type of land cover: open water, bare 
soil, savannah, forest. Combining all the images and scanning the time series from 
the lowest water level to the highest water level allowed the flooded surface areas to 
be retrieved at the minimum, maximum and intermediate flood levels for each 
landscape unit. The area flooded was assessed for each floodplain system to provide 
the variation in the areas of the flooded forests, the floating vegetation that emerges 
at the water surface, and the open waters of the floodplain lakes. 

In quadrant c (Fig. 1), 31,435 km” are seasonally flooded, of which 37.7% is 

occupied by floodable forests, 10.1% by floating macrophytes, 17.6% is occupied 
by permanent river channels, 13.1% is permanent open lakes in the floodplain, and 
21.1% is temporary open waters in the floodplain. Compared to the central 
Amazon reference quadrant (quadrant b in Fig. 1), open waters in the floodplain 
comprise more of our study area (Extended Data Table 1a, b). This sampling area 
thus provides a large array of floodplain conditions, from predominantly flooded 
forest floodplains, as in most of quadrant b, to open lakes that are more discon- 
nected from wetland vegetation. There is a clear gradient in floodplain vegetation 
along this river section, beginning from the dominant flooded-forest coverage 
upstream on the Solimées River, which represents up to 75% of the total flooded 
area, and ending with the open-lake coverage dominant downstream on the Amazon 
River’, in which only 10% of the floodplain is occupied by forest. The Curuai flood- 
plain (2,333 km?) and the Cabaliana floodplain (1,822 km’) are the largest wet- 
lands, whereas the Janauaca floodplain extends over only 216 km’ at high water. 
The five floodplains studied show a downstream-decreasing trend of flooded- 
forest coverage (Extended Data Table Ic). 
Field work. The CARBAMA database is the largest data set of water pco, and 
ancillary parameters (temperature, conductivity, dissolved oxygen, fluorescence 
and turbidity) recorded in a large tropical river-floodplain system. The total 
number of po, measurements obtained was 47,786 (Extended Data Table 2), 
which, with a frequency of one measurement per minute, represents 32 days of 
continuous measurements while underway in the Amazon River, tributaries and 
five floodplain lakes. The eight cruises were distributed along the Amazon hydro- 
graph to sample low water (November 2007, October 2008 and October 2009), 
rising water (January 2007 and January 2011), high water (May 2008 and June 
2009), and falling water (August-September 2010). 

The cruises consisted of continuous measurements while underway (Extended 
Data Fig. 1), as well as discrete water sampling for chlorophyll a (Chl a) analysis. 
The continuous measurement system consisted of a circuit of river water installed 
ina typical Amazonian ship, with a bypass at the engine cooling water inlet located 
at a depth of about 40 cm. This water flowed gravimetrically into a five-litre bucket 
located in the vessel’s hold that contained a submerged 220-V pump that raised 
water 5 m at a rate of 8 litres per minute. The water was allowed to overflow the 
bucket, and the inflow was optimized to exclude gas bubbles and reduce water 
residence time in the bucket, even when the ship was sailing at maximum speed, 
around 20kmh !. The pump pushed water from the ship’s hold to a one-litre 
bucket, which was also continuously overflowing, located on the side of the boat. 
The water volume in this second bucket at deck level was minimized because it 
contained an Yellow Spring Instruments (YSI) multisensory probe that recorded 
ancillary parameters and a 12-V submerged pump that delivered water at a con- 
stant flow to the equilibrator (about one litre per minute) and to a closed circuit 
passing through a BBE-Moldaenke fluorimeter (2-3 litres per minute). There was 
a height difference of approximately 2 m between the first bucket in the hold and 
the container with the sensors; the equilibrator was placed approximately 1 m 
above the deck floor. By adding salt to the first bucket and measuring the con- 
ductivity in the second bucket, we determined the maximum water residence time 
to be 2.5 min. Comparison with direct measurements in the river revealed that the 
water temperature increased by no more than 0.5 °C in the circuit. There was no 
notable difference in the oxygen concentrations measured with the same optode 
inside and outside the water circuit. A Garmin global positioning system recorded 
the position every minute. The instrumental set-up was also converted to a 12-V 
version that could be used on small boats able to enter large, remote shallow lakes 
inaccessible to vessels at low water levels (Extended Data Fig. 1). In this case, the 
12-V pump and the YSI probes were submerged directly from the side of the boat. 

Ina marble-type equilibrator, water flows from the top to the bottom (0.7-1 litre 
per minute), while a constant volume of air (about 0.5 litres per minute) flows from 
the bottom to the top; surface gas exchange is optimized around the marbles, and 
the volumes of water and air are minimized by the presence of the marbles. peo, 
in the air circuit tends rapidly to equilibrate to the water p.o, value’. The pump 
delivers air from the top of the equilibrator to a LI820 LICOR gas analyser, and 
then back to the bottom of the equilibrator. The air from the equilibrator passes 
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first through a water trap, then through a desiccation tube (10 cm long, 2.cm in 
diameter) containing a desiccant (Drierite), then through the air pump, the gas 
analyser, a gas flow meter, and finally, back to the bottom of the equilibrator??. 
All tubes are made of Tygon or polytetrafluoroethylene (both gas-tight plastics) 
and the connections are also gas-tight. If the equilibrator is bypassed, the air circuit 
loses less than 1 p.p.m.v. of CO, per minute at about 3,000 p.p.m.v. The drying 
tube was intentionally small to limit the effect of the interaction of Drierite with 
CO) (adsorption/desorption), which increases the response time of the system”. 
We changed the Drierite in the tube every day and flushed the system with 
atmosphere for half an hour before measurements. The Drierite was regularly 
regenerated on board using a hairdryer. In the laboratory, before and after each 
cruise, the LICOR gas analyser was calibrated in the 0-20,000 p.p.m.v. range by 
nitrogen passing through soda lime to calibrate the zero and by a gas standard 
certified at 5,000 + 30 p.p.m.v. (Air Liquide) to calibrate the span. The linearity 
was checked using another standard at 500 + 8 p.p.m.v. During cruises, as advised 
by the manufacturer, we regularly checked the zero of the instrument by passing air 
over soda lime, and we reset the zero when the deviation was more than +10 p.p.mv. 
After the cruise, the maximum deviation with the 5,000 p.p.m.v. standard was 
+ 90 p.p.m.v., that is, less than 2%. 

The YSI multiprobe measured the water temperature, conductivity, dissolved 
oxygen, pH and turbidity. We applied the calibration procedures recommended 
by the manufacturer. Calibration was performed at the beginning of each cruise 
and at the middle of cruises that lasted more than 10 days. The BBE-Moldaenke 
Fluoroprobe is a submersible spectral fluorometer that allows continuous record- 
ing of total in vivo fluorescence and discriminates among the main spectral groups 
of phytoplankton (that is, diatoms and dinoflagellates, blue-green algae, crypto- 
phytes and green algae) on the basis of the relative fluorescence intensity of Chl a at 
680 nm (because of the Photosystem II core pigments) after sequential excitation 
of the antenna and accessory pigments by five light-emitting diodes at 450 nm, 
525 nm, 570nm, 590nm and 610 nm (refs 41, 42). The Chl a concentration was 
measured in two to three discrete water samples collected in each floodplain and 
river. The samples were filtered through Whatman 47-mm GF’F glass-fibre filters, 
stored frozen (in liquid nitrogen) until the return to the laboratory, and then 
extracted in 90% acetone overnight. Fluorescence measurements were performed 
using a Turner Designs 10-AU fluorometer before and after acidification with 
hydrochloric acid”. 

Respiration in river and lake surface waters was measured with 24-h incuba- 
tions. Fifteen calibrated 60-ml Biochemical Oxygen Demand (BOD) bottles were 
filled with a well-homogenized and re-oxygenated (about 80% saturated) water 
sample. Five bottles were used for time zero, and ten were incubated in the dark at 
the temperature of the river water. Five bottles were fixed at 12 h, and the last five 
bottles were fixed at 24h. In floodplain lakes, where phytoplanktonic production 
may occur, oxygen production in the light was measured by applying the same 
procedure, except that the samples were incubated on-deck at incident light, in a 
bucket thermostated with lake water. The oxygen concentrations were measured 
onboard the vessel with Winkler titration. Respiration and production were cal- 
culated from the linear regression of the oxygen concentration as a function of 
time; the correlation coefficient was greater than 0.90. The respiration rates were 
multiplied by the water depth to estimate the depth-integrated respiration, with an 
assumption of vertical homogeneity”. In the case of turbid river channels, where 
water depth is high and light penetration is very low, we assume that phytoplank- 
tonic production does not occur and that the net heterotrophic metabolism equals 
the depth-integrated respiration’. With floodplain lakes, we made this assump- 
tion only for the high-water period, when phytoplankton biomass was low. 

Flux computation. The air-water flux of CO, was calculated from the water 
temperature and p,o, according to: 


F(CO>) = koApco, 


where k is the gas transfer velocity of CO, (in units of cmh™! or ms_'), « is the 
solubility coefficient of CO2 (in mol kg™ Tatm '), and APco, is the air-water gra- 
dient of peo, (in atm). The atmospheric p-o, measured every day at the beginning 
and end of the equilibrator records was 402 + 24 p.p.m.v. and did not significantly 
change with the seasons, between morning and evening, or between lakes and chan- 
nel sites; this average atmospheric value was used for all flux calculations, and Ap¢o, 
was largely driven by fluctuation in the water p.o,. We used the polynomial for- 
mulation that gives the solubility coefficient « as a function of water temperature“. 
The gas transfer velocity k was calculated from the gas transfer velocity normalized 
to a Schmidt number of 600 that corresponds to CO at 20°C (ref. 45): 


k = keoo (600/Sc)” 


where kgoo is the normalized gas transfer velocity, Sc is the Schmidt number of a 
given gas at a given temperature”, and n equals 2/3 in floodplain lakes and 0.5 in 
the more turbulent river channels”. 

Despite numerous studies on the subject, important uncertainty still remains 
regarding the parameterization of k¢go in lakes, particularly in Amazon floodplain 
lakes. The widely used parameterizations based on wind speed*’* do not account 
for other drivers (or inhibitors) of turbulence at the water-air interface, such as the 
friction of the current on the bottom”, evaporative heat gain and loss and the 
associated buoyancy fluxes*’*’, or the release of organic molecules that might 
behave as surfactants” by wetland vegetation”. Recently, several authors*”*° have 
hypothesized that the kgo9 value of 2.7 cmh7! used in previous estimates of CO, 
outgassing from the Amazon floodplains is underestimated by a factor of four to 
five. Experimental data of keo9 based on eddy covariance’’ confirm this under- 
estimation exclusively in open lakes under high light incidence and heat flux; on 
the contrary, the k¢o9 values may be much lower within a flooded forest, where no 
keoo data are available, where waters are protected from heat and wind by the 
canopy, where organic surfactants may be particularly abundant™ and which 
represent 70% of the floodable area in the Amazon". Because of the paucity of 
reliable information on k¢oo in floodplains, we provide average CO} fluxes from 
floodplains based on the value of 2.7 cm h7 +, and this value has the advantage of 
allowing quantitative comparison with the CO, fluxes reported previously*. When 
a more precise k¢oo value seems necessary, for instance, in the quantitative com- 
parison between respiration rates and outgassing fluxes, or for the calculation of 
the distance of wetland CO, export, we apply the Cole and Caraco relationship* to 
the wind speeds at 10 m measured on the roof of the vessel at stations during the 
cruise. For the river channels, we use the value of 8.2 cmh‘! used by Richey et al.® 
and based on direct measurements of O, and *”*Rn accumulation in free-floating 
chambers”**”. In all cases, the chosen keoo values are specified in the text. It is 
worthwhile to note that none of the main conclusions of our paper is affected by 
the choice of the k¢oo value. 

The revised carbon budget was computed at the scale of quadrant b, multiplying 
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Upwash exploitation and downwash avoidance by 
flap phasing in ibis formation flight 


Steven J. Portugal’, Tatjana Y. Hubel', Johannes Fritz, Stefanie Heese”, Daniela Trobe”, Bernhard Voelkl*3+, Stephen Hailes)*, 
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Many species travel in highly organized groups'*. The most quoted 
function of these configurations is to reduce energy expenditure and 
enhance locomotor performance of individuals in the assemblage*™’. 
The distinctive V formation of bird flocks has long intrigued researchers 
and continues to attract both scientific and popular attention*””™. 
The well-held belief is that such aggregations give an energetic bene- 
fit for those birds that are flying behind and to one side of another 
bird through using the regions of upwash generated by the wings of 
the preceding bird*”’", although a definitive account of the aero- 
dynamic implications of these formations has remained elusive. 
Here we show that individuals of northern bald ibises (Geronticus 
eremita) flying in a V flock position themselves in aerodynamically 
optimum positions, in that they agree with theoretical aerodynamic 
predictions. Furthermore, we demonstrate that birds show wingtip 
path coherence when flying in V positions, flapping spatially in phase 
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Figure 1 | V formation flight in migrating ibises. a, Northern bald ibises 
(G. eremita) flying in V formation during a human-led migratory flight 
(photograph M. Unsdld). b, Three-dimensional location histogram of the 

7 min flight section, showing position of individual ibises (n = 14) in the V 
formation, with respect to flock centroid, measured by a 5 Hz GPS data logger. 
The colour scale refers to the duration (in seconds) a bird was present in each 
0.25 m X 0.25 m grid. A plot detailing the formation shape for the duration of 
the entire flight can be found in Supplementary Fig. 7. c, Histogram of number 


and thus enabling upwash capture to be maximized throughout the 
entire flap cycle. In contrast, when birds fly immediately behind 
another bird—in a streamwise position—there is no wingtip path 
coherence; the wing-beats are in spatial anti-phase. This could poten- 
tially reduce the adverse effects of downwash for the following bird. 
These aerodynamic accomplishments were previously not thought 
possible for birds because of the complex flight dynamics and sensory 
feedback that would be required to perform such a feat'”"*. We con- 
clude that the intricate mechanisms involved in V formation flight 
indicate awareness of the spatial wake structures of nearby flock- 
mates, and remarkable ability either to sense or predict it. We suggest 
that birds in V formation have phasing strategies to cope with the 
dynamic wakes produced by flapping wings. 

Theories of fixed-wing aerodynamics have predicted the exact span- 
wise positioning that birds should adopt in a V formation flock to 
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of flaps (colour coded) recorded in each 0.25 m X 0.25 m region between all 
birds and all other birds. Most flaps occurred at an angle of approximately 45° 
to the bird ahead (or behind). Transects denoted by dashed lines, directly 
behind or along the most populated V favoured position (just inboard of 
wingtip to wingtip), are the same as those detailed in Fig. 3. d, Histogram 
detailing the total number of flaps recorded between each bird-bird pair, with 
respect to position of the following bird. The shaded area (ii-i) denotes the 
limits of optimal relative positioning, based on fixed-wing aerodynamics. 
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Figure 2 | Histograms demonstrating the positional infidelity for each 
northern bald ibis in the V formation during the migratory flight. The grey 
shaded V shape behind each individual histogram (n = 14) denotes the 
structure for all individuals in the flock (see Fig. 1b). The colour code refers to 


maximize upwash capture*”"'*. The primary empirical evidence con- 
firming that this mechanism is used is a reduction in heart rate and 
wing-beat frequency in pelicans flying in a V formation’. There is a 
general lack of experimental data from free-flying birds, mainly 
because of the complications of measuring the intricate and three- 
dimensional complexity of formation flight, and the lack of appropri- 
ate devices to monitor and record such information. Therefore, the 
precise aerodynamic interactions that birds use to exploit upwash 
capture have not been identified. To investigate the purported aero- 
dynamic interactions of V formation flight, we studied a free-flying 
flock of northern bald ibises (Geronticus eremita) (Fig. 1a), a critically 
endangered migratory species. We used new technology’*’® to mea- 
sure the position, speed and heading of all birds in a V formation. We 
recorded position and every wing flap of 14 birds during 43 min of 
migratory flight using back-mounted integrated global positioning 
system (GPS) (5 Hz) and inertial measurement units (300 Hz) (see 
Methods)'>"®. The precision of these measurements allows the relative 
positioning of individuals in a V to be tracked, and the potential 
aerodynamic interactions to be investigated at a level and complexity 
not previously feasible. 

During a 7 min section of the flight, where most of the flock flew in 
approximate V formation in steady, level and planar direct flight (see 
Methods), we found wing flaps occurred at an angle of, on average, 45° 
to the bird ahead (or behind), and approximately 1.2 m behind (Fig. 1b-d). 
The most populated 1 m X 1 m region was 0.49-1.49 m behind and to the 
side of the bird ahead. The centre of the most populated (0.25 m) spanwise 
region was at 0.904 m, resulting in a wingtip overlap’ ’ of 0.115 m (Fig. 1c, 
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the duration (in seconds) a bird was present in each 0.25m X 0.25 m grid. 
Although individual birds showed some bias towards the front, back, left or 
right regions of the V formation, these positions were not maintained rigidly. 


d; wingspan b = 1.2 m). This falls within the bounds of predictions of fixed- 
wing theory?’ for maximizing the benefits from upwash, which range 
from zero wingtip overlap (assuming no wake contraction*) to, maximally, 
0.13 m (assuming elliptical loading over the pair of wings, and full wake 
contraction from wingspan b to 1b/4)°. 

During this 7 min section of V formation flight, individual birds 
show a certain degree of positional infidelity in the V flock (Fig. 2; 
see also Supplementary Fig. 1 and Supplementary Video 1). Although 
individuals contribute to the statistical V formation, their positioning 
is inconsistent. Certain individuals showed general preferences for a 
particular area in the V formation, but the variability in positioning in 
the flock resulted in no clear leader (see Supplementary Information 
for further discussion). 

Although we observe that, when flying in a V, ibises position themselves 
in locations predicted mathematically from fixed-wing aerodynamics*?™’, 
the wake of flapping birds (in this study, ibises spent 97% of their time 
flapping; see Methods) is likely to be complex’. Wingtip path coher- 
ence, where a flying object flaps its wings in spatial phase with that of 
the individual it is following, has been proposed as a method that would 
maximize upwash capture in V formation flight of birds and flying ro- 
botic devices’. Whether birds are able to take advantage of this extra 
level of complexity present in flapping flight (compared with that of 
fixed-wing flight) has remained unanswered so far. 

In the ibis flock, individual flaps for each bird were described from 
the dorsal acceleration signal from the inertial measurement unit’. 
The temporal phase ¢,emporal is defined here as the proportion of a flap 
cycle of a leading bird at which a following bird initiates a flap. Spatial 
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Figure 3 | Geometric and aerodynamic implications of observed spatial 
phase relationships for ibises flying in a V formation. Temporal phase 
increases as a function of position behind more advanced birds (median + 95% 
confidence intervals of phase for each mean bird-bird interaction in a region). 
When positioned close to a wavelength in line with the V favoured position (a- 
c), wingtip paths approximately match: observed temporal phases agree with 
those predicted from the significant spatial phase relationship (thick black lines, 
+95% confidence intervals) at the most populated 1m X 1 m region, using the 
mean wavelength measured for each position. When positioned directly in line 
(d-f), following birds flap in spatial antiphase, maximally separating wingtip 
paths. In this case the model line is derived from the median spatial phase for all 
bird-bird interactions up to 4m directly behind. Induced flow velocities (blue 


phase P spatial makes use of the temporal phases, and takes account of 
the number of wavelengths, 2, between the bird ahead and the bird 
behind: 


Pspatial = temporal —2nd. 


A spatial phase of zero would indicate that, were the birds to be directly 
following each other, the wingtip paths would match. 

In the most populated 1m 1m favoured V position (Fig. Ic), 
Rayleigh’s test’” for circular statistics indicates a significant unimodal bias 
in both temporal (Rayleigh, P = 0.018, mean phase = 0.857; Hodges- 
Ajne’s test, P = 0.012) and, more strongly, spatial (Rayleigh, P = 0.003, 
mean phase = —1.155; Hodges—Ajne, P = 0.004) phases (Fig. 3a, b) (see 
Supplementary Table 1 for further statistics; Supplementary Figs 2a, 3a and 
4a). Flapping in spatial phase indicates that the wing ofa following bird goes 
up and down tracking the path through the air previously described by the 
bird ahead. The following bird then benefits from consistently flapping into 
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arrows, ¢, f), caused by the trailing wingtip vortices of the bird ahead (vortex 
cores denoted by grey circles), are modelled as infinitely long, parallel vortex 
filaments. Birds flying in typical V formation keep their wings close to the 
region of maximal induced upwash (c) throughout the flap cycle. Birds flying 
directly behind flap in spatial antiphase, potentially reducing the adverse effects 
of downwash (f), both in terms of magnitude and direction. For scale, the 
downwash directly between the vortices would be (—)0.3 m s |, between 
trailing vortices behind a bird of mass 1.3kg, span 1.2 m at a speed of I5ms_ 
(no account is taken of flapping, viscosity or wake contraction). Alternative 
representations of a and d as Cartesian plots can be found in Supplementary 
Fig. 3, and Supplementary Fig. 4 details the extended data array shown beyond 
the presented model line. 


a 


the upwash region from the preceding bird (Fig. 3b, c), presumably redu- 
cing the power requirements for weight support’*"*. 

In contrast, birds flying directly behind, tracking the bird ahead in a 
streamwise position (sampled region 0.5m across, 4m streamwise, 
Fig. 1c), flap in close to spatial antiphase (median = 2.897, where 
precise antiphase would be +3.142), significantly (P < 0.05) deviating 
from flapping ‘in’ spatial phase (see Supplementary Table 1 for further 
statistics; Supplementary Figs 2b, 3b and 4b). As such, the wingtip 
paths of the following bird do not match those of the preceding bird, 
and the wingtip paths are close to maximally separated. Birds flying 
directly behind another one in a streamwise location flap in spatial 
antiphase (Fig. 3d, e; see also Supplementary Figs 2b and 3b), poten- 
tially reducing the adverse effects of downwash (Fig. 3f), both in terms 
of magnitude and direction. If this position was aerodynamically 
adaptive, it would be predicted to be favoured at higher speeds, where 
parasite power is relatively high’*, compared with the induced power 
costs of weight support; forms of slipstreaming can reduce the drag 
experienced by followers***"’, even in cases where there is zero net 
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horizontal momentum flux in the wake (that is, drag = thrust)—as in 
steady swimming—owing to temporal or local spatial°”°’ fluctuations 
from mean wake conditions. Whether the position immediately 
behind is accidental or intentional, and whether it offers any aero- 
dynamic advantage or cost, is currently unclear. However, the wing- 
beat phasing observed when in this position would serve to displace the 
following bird’s wings from regions of greatest downwash (presumably 
immediately inboard of the trailing wingtip vortices, close to wingtip 
paths described by the previous bird), through most of the flap cycle. 

In transects both directly streamwise and along the favoured V 
position (Fig. 1c), temporal phase increases proportionally with dis- 
tance behind the focal bird (Fig. 3a, d), with a full 2m cycle change in 
phase over a complete wavelength; spatial phase is approximately 
maintained up to 4m behind the leading bird. Previously, there was 
much uncertainty about spatial wing-beat phasing and wingtip path 
coherence in flapping organisms. The only previous biological evid- 
ence of this phenomenon has come from tethered locusts, where dis- 
tance manipulations between a leading locust and a follower altered 
the phase patterns of their wing-beats****. Physical models also sup- 
port the potential for aerodynamic advantage due to phasing: appro- 
priate timing between tandem flapping in model dragonfly wings 
improves aerodynamic efficiency™*. Theoretical engineering models 
have taken into consideration flapping flight, and the extra benefits 
a flapping wing may accrue in formation flight'*'*. Such models have 
suggested that upwards of 20% variation exists in the induced power 
savings to be gained, if flapping is done optimally in spatial phase, 
compared with out of phase’ (Supplementary Fig. 4). 

Here we have shown that ibis flight in V formation does, on average, 
match predictions of fixed-wing aerodynamics (Fig. 1c, d), but that 
flock structure is highly dynamic (Fig. 2). Further, temporal phasing of 
flapping relates both to streamwise and to spanwise position. This 
indicates remarkable awareness of, and ability to respond to, the wing- 
path—and thereby the spatial wake structure—of nearby flock-mates. 
Birds flying in V formation flap with wingtip path coherence—the 
wingtips take the same path—placing wings close to the oscillating 
positions of maximal upwash. In contrast, birds flying in line flap in 
spatial antiphase—the wingtip paths are maximally separated—con- 
sistent with avoidance of adverse downwash. This raises the possibility 
that, in contrast to conventional aircraft, following birds may be able to 
benefit from ‘drafting’ while, to a certain extent, avoiding an increased 
cost of weight support by evading localized regions of downwash. 
Optimal flight speeds would differ between solo flight, V formation 
flight and (whether net-beneficial or not) in-line flight, potentially 
providing some account for the unstable, dynamic nature of V forma- 
tion flocks. 


METHODS SUMMARY 

Measurements. We equipped 14 juvenile northern bald ibises with back-mounted 
synchronized GPS (5 Hz) and inertial measurement units (300 Hz), mass 23 g 
(Supplementary Fig. 8), which were custom made in our laboratory, and tested 
and validated for accuracy and precision’*"*. At the start of migration, the mass of 
the birds was 1.30 + 0.73 kg, the 23 g loggers constituting approximately 3% of the 
body mass of the smallest bird. This is below the recommended 5% for flying 
animals”. The ibises formed part of a large-scale conservation programme. They 
had been hand-reared at Zoo Vienna (Austria), imprinted onto human foster 
parents and taught to follow a powered parachute (paraplane) to learn the migra- 
tion routes (Methods). Experimental protocols were approved by the Royal 
Veterinary College local Ethics and Welfare Committee. A GPS trace of the ibis 
flight imposed over Google Earth (Landsat) can be found in Supplementary Data 1 
asa KML file. GPS data were post-processed using GravNav Waypoint ™ software!>*, 
and inertial measurement unit data by custom-written MATLAB (R2012b, Math- 
works) programs'*’°. Mean flap frequency, speed and peak detection protocols are 
detailed in Supplementary Figs 5 and 6. For further details on post-processing, see 
Methods. 

Statistical analysis. Circular statistics'’ were done in LabVIEW (National Instru- 
ments). First-order (Rayleigh test) and second-order (Hodges-Ajne) statistics were 
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used to test the phasing of wing beats for significant deviations from random 
distribution. For further details on statistical analysis, see Methods. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Birds. Northern bald ibises (G. eremita) (n = 14, five females and nine males) were 
hatched at Zoo Vienna, Austria, in March 2011, and imprinted immediately onto 
human foster parents (S.He. and D.T.). At 4 months of age, the birds began training 
flights behind a powered parachute (paraplane). Training flights lasted between 1 
and 4h, and were up to 5km in length. At the end of July, birds were fitted with 
dummy loggers to prepare them for being equipped with data loggers for the long- 
distance migratory flights. The mass of the birds at the start of migration was 
1.30 + 0.73 kg. As such, the 23 g loggers constituted approximately 3% of the body 
mass of the smallest bird. This is comfortably below the recommended 5% for flying 
animals**””. Experimental protocols were approved by the Royal Veterinary College 
local Ethics and Welfare Committee The loggers were externally attached, using 
Velcro and a harness (Supplementary Fig. 8). The dummy loggers remained on 
when birds were at rest in the aviary, which was at all times apart from the migratory 
flights. The first migratory flight began in August. The total migratory flight plan 
was from the training site near Salzburg, Austria (47.75377° N, 13.052959° E), to 
Orbetello, Italy (42.425484° N, 11.232662° E). Once en route, birds were flown, on 
average, every third day. During flights, the birds followed the paraplane, but were 
typically to the side of the vehicle, on average 147 m laterally, consistently to the left, 
except for one turn (see Supplementary Figs 5 and 6). All loggers functioned fully. 
The birds were flown early in the morning (7:00 departure): later flight times 
increased the occurrence of thermalling and gliding, resulting in the birds not 
following the paraplane sufficiently. A GPS trace of the full flight, imposed over 
Google Earth (Landsat), can be seen in Supplementary Data 1 (as a KML file). The 
recorded flight was the second stage of the migration. 

Data loggers. Further information about the loggers can be found in refs 15 and 
16. Briefly, GPS was recorded at 5 Hz and data were post-processed differentially 
over the short baseline between base station and ibises, using Waypoint GrafNav 
8.10. L1 coarse/acquisition (C/A) code pseudo-range measurements were used to 
calculate the position of each GPS logger, with velocity determined from L1 
Doppler measurements. Using this approach can provide positional accuracy to 
0.3 mand speed accuracy better than 0.1 ms’ '. Accelerometer data were recorded 
at 300 Hz. 

Initial data processing. The flight was checked for any periods when the birds had 
maintained periods of circling flight (note we do include one circle in our 
sequence) through examination of the GPS and accelerometer traces, and these 
sections were removed (less than 4 min of the total flight duration). The remaining 
flight, therefore, consisted of straight-line flight. The take-off and landing periods 
were removed, as, when taking off, it took approximately 4 min for the birds to 
form a coherent flock, and to follow the paraplane. Similarly, when the paraplane 
began to descend at the end of the flight, the birds separated and began to glide 
during descent. The position of the paraplane was recorded and tracked by a data 
logger (see Supplementary Figs 5 and 6). The GPS, recorded at 5 Hz, was inter- 
polated (MATLAB, R2012b, Mathworks) to the same sampling rate as the accel- 
erometer data, at 300 Hz. The interpolation replaced missing values in the GPS. 
GPS and accelerometer data were passed through a fourth-order Butterworth filter 
(MATLAB). To produce the histograms (Fig. 1b, c), the original GPS values were 
used after being interpolated to a constant 5 Hz sampling frequency. In Figs 1b and 
2, the colour scale refers to the duration a bird was present in each 0.25 m X 0.25m 
grid. For Fig. 1c, the colour scale refers to the number of flaps recorded in each grid. 
In Fig. 1c, the regional transect labelled ‘streamwise transect’ is offset because, for 
display and analysis, all data from the left side are mirrored to the right so all data 
points are on one side; thus the centre of the first sampled region lies 0.125m 
behind and to the right of the lead bird. Dorsal acceleration was used to determine 
each wing flap, and the upper reversal point”* of the flap cycle (see Supplementary 
Figs 5 and 6). Note that this reversal point in acceleration of the back need not 
relate to peak wing elevation—or indeed any particular wing kinematic—for the 
phasing analysis to function. 

Height. Height was recorded. The precision of height measurements, however, is 
lower than for horizontal positions”. This is because there were no satellites below 
the birds, and this geometry of the satellites caused a reduction in precision”. We 
do not consider vertical position because of the small ‘signal’ of interest (very slight 
vertical deflections) compared with the relatively high ‘noise’ (inevitable because of 
GPS satellite geometry). We chose a section where, according to the available error 
measurement calculated by Waypoint (see “Data loggers’), the height values were 
relatively consistent and that during this flight portion the birds were flying close 
to the same horizontal plane. 

Calculating flock formation and individual positioning. To establish position- 
ing of individuals and structure in the flock, a flock centroid was determined. To 
calculate the centroid of the flock, the MATLAB function ‘centroid’ was used. This 
function calculates the centroid of a polygon. The MATLAB centroid function 
treated each bird as a point ofa polygon, and determined the centroid for each time 
point. An average speed was calculated and any birds with a speed discrepancy 
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higher than 3ms~ | away from the mean flock speed were removed for that time 
point. From this, the resultant centroid was calculated now containing only birds 
close in position and speed. A rotation matrix was applied to the data to re-orient 
the heading so all birds were heading ‘up’, and the direction of the centroid was 
always in the positive “Y’ direction. The resultant matrix comprised a position for 
each bird for each sampling point. Theta (0), the angle between each bird and the 
lead bird, was calculated, transforming Cartesian to polar coordinates (cart2pole, 
MATLAB). For data presentation in the histograms (Fig. 1b), the field of view was 
set to 15m X 15m, and the area was divided into a 60 X 60 grid of bins (0.25 m 
X 0.25 m). Position 0/0 is the centroid. The heat histograms are shown as contour 
plots with five contour levels. 

During the 7 min section of V formation flight, individual birds showed a certain 
degree of positional infidelity in the V flock (Fig. 2; see also Supplementary Fig. 1 
and Supplementary Video 1). Although individuals contributed to the statistical V 
formation, their positioning was inconsistent. Certain individuals showed general 
preferences for a particular area in the V formation, whether left, right, front or 
rear, but the variability in positioning resulted in no clear leader in the flock (Fig. 2). 
Navigational ability and kin selection have been proposed as principal drivers of 
leadership in V formation flight*®, with more experienced birds or parents of a 
family group taking the lead*°. The ibis flock in the present study comprised birds 
of the same age (<1 yr old), with no previous navigational experience of the route 
and no parent-offspring relationships. The absence of immediate kin selection and 
learnt navigational ability as possible factors determining a V formation structure 
in the recorded flight strengthens the evidence for an aerodynamic function behind 
the V formation observed in the ibis. The young age of the birds, however, may be 
the main factor why there was a lack of a clear leader in the ibis flock, contrasting 
with previous observations of adult ibises, in which consistent leaders in flocks 
were identified*’. Spontaneous and inconsistent leadership has been identified in 
bird flocks either where no consistent social hierarchy exists**, or when no previous 
knowledge of a route is known”. For other ‘classic’ V formation fliers, the first migra- 
tion is a significant cause of mortality for young birds, even when migrating with 
parents. As such, aerodynamic mechanisms that reduce the energetic cost of (albeit 
only very infrequent) migratory flight may present considerable selection advantage. 
Movement in flock. Movement in the V formation was investigated by taking a 
45° line, the preferred angle for positioning with the V (Fig. 1c) as a transect from 
the apex of the V. The apex was determined by the intersection of two 45° lines, 
down each side of the V formation. For every bird for each time point, we mea- 
sured how far it was positioned from the 45° perpendicular transect line. For 
simplicity of analysis, all data were flipped (mirrored) so they could all be plotted 
against one 45° line. In Supplementary Fig. 1a, the red circles represent the original 
positions of the birds, for all birds and all times. From this, the shortest distance to 
the 45° line was calculated (blue line) and the position was projected on the 45° 
line; then the distance between projected position and green circle (the centroid) 
was calculated. The standard deviations are from the blue perpendicular line 
rather than the absolute distance, and represent how much the position varied 
with respect to the line. A mean (s.d.) was then calculated for each perpendicular/ 
parallel relationship (Supplementary Fig. 1b, c). The positioning of all individuals 
varied more along the line than out from the line (Supplementary Fig. 1c). If the 
changes in position were due only to error in logger measurement, the variation in 
perpendicular and parallel distance and position would be expected to be equal. 
Because most of the variation is present along the line, the variation can be con- 
fidently attributed predominantly to bird movement, not logger noise. 

Circular statistics and phasing analysis. Circular statistics were applied using 
LabVIEW (National Instruments), following that of Fisher’”***°, 

The relative positions (in the direction of flight) and phase relationships (as a 
proportion of the flap cycle of each ‘ahead’ bird) were determined for every bird 
following another individual. Determining appropriate independent sample cri- 
teria when considering phases is vital’’, and presents a challenge when analysing 
phase relationships. Consider the case of two birds flying at the same relative 
position and at the same frequency; they would maintain the same phase relation- 
ship indefinitely. Each flap would certainly not be considered an independent 
sample. As a conservative alternative, we take a mean phase for any bird-bird 
pairing for a given area to be an independent sample; no account is taken of the 
length of time or number of flaps spent in the area. Perversely, this technique 
actually makes use of the variability in relative position, and would be poor for 
absolutely rigid V formations. 

Statistical tests'’**°° analysed two regions, combining left and right sides: one 
representing V formation flight (from 0.49 to 1.49 m both spanwise and stream- 
wise), containing the highest density of flaps; the other for nose-to-tail, streamwise 
flight, covering a volume 0.25 m spanwise from midline (so 0.5 m behind) and 4m 
behind. This provided n = 165 and n = 160 bird-bird pairs for V formation and 
nose-to-tail regions, respectively. 
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The Rayleigh test was applied to determine the presence of a single unimodal 
direction in phase without preconceptions of any mean direction. This found a 
significant departure from randomness—a significant unimodal bias—in phase 
(whether temporal or spatial) for the V formation region. Both Rayleigh’s test 
(parametric) and Hodges—Ajne’s test'”** °° (non-parametric) on this region indi- 
cated that both the temporal and the spatial phases (taking into account the 
wavelength of whichever bird was ahead) were significantly different from those 
that would be found from a random distribution*”**. 

The median phase for a given region—and its 95% confidence intervals—allows 
a specified alternative to be tested against. Fig. 3a, d and Supplementary Fig. 3a, b 
show the median statistics in graphical form for the two regions. Zero or ‘in’ spatial 
phase falls outside the 95% confidence intervals for the nose-to-tail region. 

The median spatial phases for the two regions described above were used to 
predict the temporal phases for 0.25 m X 0.25 m along two streamwise transects 
using the wavelength measured for each volume along the transect. If the median 
spatial phase was m—out of phase, as it is close to in the nose-to-tail transect—we 
would predict it to be m every integer number of wavelengths, and 0 or ‘in’ 
temporal phase at '/2, 3/2, 5/2, etc. wavelengths. The model—with bounding 
confidence intervals due the spatial median—is shown as lines in Fig. 3a, d. 
Measured median temporal phases (+95% confidence intervals of the median) 
broadly match the predicted values (see also Supplementary Fig. 3a, b, which gives 
the same data in Cartesian form). Although the fit between model and observed 
temporal phases is visually convincing, formal statistical treatment is avoided 
because of uncertainty over independence between neighbouring spatial regions 
along the transects. 

Modelled induced flow behind flapping birds. The implications of flap phasing 
in terms of potential interaction with induced flows are shown in Fig. 3c, f. For this 
model, it is assumed that the wingtip vortex left behind a bird ahead (the grey bird) 
ofa follower (the black bird) follows the wingtip path through space—the convection 
of the vortex core (which, on average, will be inwards and downwards) is neglected”. 
Induced flow-fields are modelled following the Biot-Savart law*'’, treating the 
wingtip vortices as infinitely long, parallel filaments; no account is taken of vari- 
ation in lift throughout the wingstroke cycle. Induced flows near the vortex cores 
are not modelled; these regions are represented by grey circles. They, although 
being correct given the reductions and assumptions described, should not be taken 


as accurate quantitative calculations of the local flowfield. However, the principles 
they demonstrate—the strongest region of upwash and downwash close to out- 
board and inboard, respectively, of the wingtip path—meet basic aerodynamic 
expectations and recent modelling results***. For scale, the downwash directly 
between the vortices would be (—)0.3 ms | between trailing vortices for behind a 
bird of mass 1.3 kg, span 1.2 mata speed of 15ms_' (without modelling flapping 
or wake contraction). 


27. Barron, D.G., Brawn, J.D. & Weatherhead, P. J. Meta-analysis of transmitter effects 
on avian behaviour and ecology. Methods Ecol. Evol. 1, 180-187 (2010). 

28. Norberg, U. M. Vertebrate Flight: Mechanics, Physiology, Morphology, Ecology and 
Evolution Ch. 9, 118-132 (Springer, 2011). 

29. Kaplan, E. & Hegarty, C. Understanding GPS: Principles and Applications Ch. 7, 304— 
334 (Artech House, 2005). 

30. Andersson, M. & Wallander, J. Kin selection and reciprocity in flight formation? 
Behav. Ecol. 15, 158-162 (2003). 

31. Petit, D.R. & Bildstein, K.L. Development of formation flying in juvenile white ibises 

(Eudocimus albus). Auk 103, 244-246 (1986). 

32. Rands, S. A., Cowlishaw, G., Pettifor, R. A., Rowcliffe, J. M. & Johnstone, R. A. 

Spontaneous emergence of leaders and followers in foraging pairs. Nature 423, 

432-434 (2003). 

33. Biro, D., Sumpter, D. J. T., Meade, J. & Guilford, T. From compromise to leadership 

in pigeon homing. Curr. Biol. 16, 2123-2128 (2006). 

34. Mardia, K. & Jupp, P. Directional Statistics Ch. 6, 94-110 (Wiley, 1999). 

35. Sprent, P. & Smeeton, N. C. Applied Nonparametric Statistical Methods Ch. 4, 

83-122 (Taylor & Francis, 2007). 

36. Batschelet, E. Circular Statistics in Biology Chs 9, 15 (Academic, 1981). 

37. Wiltschko, W. et al. Lateralisation of magnetic compass orientation in a migratory 
bird. Nature 419, 467-470 (2002). 

38. Holland, R.A. etal. Testing the role of sensory systems in the migrating heading of a 
songbird. J. Exp. Biol, 212, 4065-4071 (2009). 

39. Hubel, T. Y. et al. Wake structure and wing kinematics: the flight of the lesser dog- 
faced fruit bat, Cynopterus brachyotis. J. Exp. Biol. 213, 3427-3440 (2010). 

40. Hubel, T. Y. etal. Changes in kinematics and aerodynamics over a range of speeds 
in Tadarida brasiliensis, the Brazilian free-tailed bat. J. R. Soc. Interface 9, 
1120-1130 (2012). 

41. Kroner, E. Dislocations and the Biot-Savart law. Proc. Phys. Soc. A 68, 53-55 
(1955). 

42. Griffiths, D. J. Introduction to Electrodynamics Ch. 5, 215 (Prentice Hall, 1998). 


©2014 Macmillan Publishers Limited. All rights reserved 


ae Ae dL Teas 


doi:10.1038/nature12788 


A mitochondrial genome sequence of a hominin from 


Sima de los Huesos 


Matthias Meyer", Qiaomei Ful?, Ayinuer Aximu-Petri', Isabelle Glockel, Birgit Nickel', Juan-Luis Arsuaga®"*, Ignacio Martinez”, 
Ana Gracia®”, José Maria Bermudez de Castro®, Eudald Carbonell”’® & Svante Piiibo! 


Excavations of a complex of caves in the Sierra de Atapuerca in 
northern Spain have unearthed hominin fossils that range in age 
from the early Pleistocene to the Holocene’. One of these sites, the 
“Sima de los Huesos’ (‘pit of bones’), has yielded the world’s largest 
assemblage of Middle Pleistocene hominin fossils”*, consisting of 
at least 28 individuals* dated to over 300,000 years ago’. The skel- 
etal remains share a number of morphological features with fossils 
classified as Homo heidelbergensis and also display distinct Neanderthal- 
derived traits**. Here we determine an almost complete mitochon- 
drial genome sequence of a hominin from Sima de los Huesos and 
show that it is closely related to the lineage leading to mitochondrial 
genomes of Denisovans”, an eastern Eurasian sister group to 
Neanderthals. Our results pave the way for DNA research on homi- 
nins from the Middle Pleistocene. 

The Sima de los Huesos site (see Fig. 1 for a map) is located at the 
foot of a 13 m vertical shaft, about 30 m below the surface and 500 m 
from the closest current entrance to the karst system’’. Humidity at the 
site is close to saturation, temperature in the cave is constant around 
10.6 °C and the fossils have been protected from major disturbances 
since deposition’’. The Sima de los Huesos is also noteworthy because 
it has provided unique evidence of long-term DNA survival. DNA 
preservation in the site was first proposed based on enzymatic amp- 
lification of a few short mitochondrial DNA (mtDNA) fragments from 
Middle Pleistocene cave bear remains'’. Recently, improvements in 
DNA extraction” and library preparation’? techniques for highly 
degraded ancient DNA have enabled the retrieval of a complete mito- 
chondrial genome of a cave bear (Ursus deningeri) found with the 
hominin remains in the cave’*. DNA preservation for hundreds of 
thousands of years has otherwise been documented only under perma- 
frost conditions'*”’. 


1 Sima de los Huesos 
2 Denisova 
3 El Sidron 
4 Valdegoba 
5 Les Rochers-de-Villeneuve 
6 La Chapelle-aux-Saints 
7 Scladina 
cal 


8 Engis 

9 Feldhofer 
10 Monti Lessini 
11 Vindija 
12 Mezmaiskaya 
13 Teshik-Tash 
14 Okladni 


Figure 1 | Location of the Middle Pleistocene site of Sima de los Huesos 
(yellow) as well as Late Pleistocene sites that have yielded Neanderthal DNA 
(red) and Denisovan DNA (blue). 


To investigate whether DNA may also be preserved in the hominin 
remains, we obtained several samples of bone, totalling 1.95g, by 
drilling holes into the breaks of a femur (Femur XIII, ref. 17) excavated 
in three parts, one in 1994 and the other two in 1999 (Fig. 2). DNA was 
isolated using a recently published silica-based method” and converted 
into 77 libraries for sequencing’®”* (Extended Data Table 1). Following 
library amplification, we first characterized a subset of the libraries by 
shallow shotgun sequencing on Illumina’s MiSeq platform (Extended 
Data Fig. 1). Overlapping paired-end reads were merged to reconstruct 
full-length molecule sequences and mapped against the human gen- 
ome using Burrows—Wheeler alignment (BWA)". For most libraries, 
fewer than 0.1% of the sequences could be confidently aligned to the 
human genome (Extended Data Table 2), but 21 libraries yielded pro- 
portions of aligned sequences that were high enough (between 0.1% 
and 8.4%) to investigate the frequencies of C to T substitutions at sequence 
ends, which are increased in authentic ancient DNA due to accelerated 
cytosine deamination in single-stranded overhangs” **. However, in 
no case did C to T substitution frequencies exceed 3% at 5’ ends and 
6% at 3’ ends (Extended Data Table 2), indicating that those libraries 
that are rich in human DNA are dominated by present-day human 
contamination. 

We next enriched all libraries for mtDNA, using a probe set based 
ona present-day human sequence. An initial inspection of the isolated 
sequences revealed the closest similarities to the mtDNA ofa Denisovan, 
an extinct archaic group related to Neanderthals’. Therefore, the lib- 
raries were additionally enriched with probes based on the Denisovan 
mtDNA”’. Sequencing was performed on Illumina’s HiSeq 2500 plat- 
form from both ends, and overlapping reads were merged and aligned 
to the human reference mtDNA. Sequences with identical start and 
end coordinates, which often represent amplification products of the 
same starting molecules, were fused to create consensus sequences, and 
sequences shorter than 30 base pairs (bp) were discarded. The enriched 
libraries yielded a sufficient number of mitochondrial sequences to 
estimate the frequencies of C to T substitutions. These varied widely 
among the libraries, ranging from 1% to 45% at 5’ ends, and from 2% to 
47% at 3’ ends (Extended Data Table 1). In agreement with the shotgun 


Figure 2 | Femur XIII reassembled from three parts after sampling. The 
natural fractures are visible in the proximal third of the femur. 
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Figure 3 | Patterns of cytosine deamination in the libraries constructed 
from the Sima de los Huesos hominin femur. a, C to T substitution 
frequencies are shown for the terminal positions of the aligned sequences for all 
sequences (black), those sequences carrying a C to T substitutions at their 5’ 


sequencing results, the libraries yielding the largest number of mito- 
chondrial sequences exhibited very low terminal C to T substitution 
frequencies (=3% and = 6% at 5’ and 3’ ends, respectively; Extended 
Data Fig. 2) indicating that they are dominated by present-day human 
contamination. Libraries showing C to T substitution frequencies of 
less than 5% at either end were considered to be too contaminated and 
therefore disregarded in subsequent analyses. 

Variation in C to T substitution frequencies among libraries suggest 
that two populations of sequences are present in the data, an endo- 
genous population strongly affected by cytosine deamination and a 
contaminating population showing much less deamination. To test if 
this is the case, we determined the 5’ C to T substitution frequencies 
for sequences showing a 3’ C to T difference to the reference and vice 
versa, thereby enriching for putatively endogenous DNA. C to T sub- 
stitution frequencies indeed increased to 55% at 5’ ends and 62% at 3’ 
ends, numbers that are close to those determined for the U. deningeri 
sample from Sima de los Huesos"* (Fig. 3a). Furthermore, stratification 
of the deamination signal by fragment length shows that the endogen- 
ous DNA is primarily present among sequences that are shorter than 
45 bp, again in agreement with the situation in the U. deningeri sample 
(Fig. 3b and Extended Data Fig. 3). Based on these results, we removed 
sequences longer than 45 bp and those that do not carry a terminal C 
to T substitution on either the 5’ or 3’ end (Extended Data Table 3). In 
addition, we applied a mapping quality filter to ensure unique place- 
ment of the sequences within the mtDNA genome and readjusted the 
alignment parameters to tolerate up to five C to T differences but no 
more than two other differences to the reference mtDNA sequence to 
discriminate against spurious alignments. Finally, T bases at the first 
and last three positions of each sequence were masked to reduce the 
impact of deamination-induced substitutions during consensus calling. 

We first called consensus bases for 15,181 positions of the mito- 
chondrial genome that were covered by 5 or more sequences of which 
at least 80% agreed. Average coverage across these positions was 21.8. 
However, such strict filtering increases the risk of ascertainment bias 
because residual modern human contamination as well as capture and 
mapping biases may lead to the exclusion of positions where the Sima 
de los Huesos specimen differs from the probes or the reference sequence. 
We therefore built a second more inclusive consensus by considering 
the three terminal positions while selecting sequences with C to T 
substitution and lowering the requirements for coverage and consensus 
agreement to 3 and >67%, respectively. This consensus encompasses 
16,302 positions or ~98% of the human mitochondrial reference gen- 
ome, with an average coverage of 31.6 (Extended Data Fig. 4). Third, 
to evaluate whether the use of Denisovan capture probes influence the 
results, we built a consensus using the strictest filtering criteria described 
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above, but including only sequences isolated with present-day human 
mtDNA probes (Extended Data Fig. 5). 

We reconstructed phylogenetic trees in a Bayesian statistical frame- 
work” using the three Sima de los Huesos consensus mtDNA sequences 
as well as the mtDNAs of present-day and ancient humans, Neanderthals, 
Denisovans, chimpanzees and bonobos. All three trees support a topology 
in which the Sima de los Huesos mtDNA shares a common ancestor 
with Denisovan mtDNAs to the exclusion of the other mtDNAs ana- 
lysed with maximum posterior probability (Fig. 4 and Extended Data 
Fig. 6). As expected owing to its age, the branch leading to the Sima de 
los Huesos mtDNA is shorter than those leading to any of the other 
archaic or present-day humans. Using 13 directly dated ancient mtDNA 
sequences for calibration” and the three consensus sequences, we esti- 
mated the age of the Sima de los Huesos specimen based on the length 
of its mtDNA branch (Table 1). These dates vary between 0.15 to 
0.64 million years with point estimates close to 400,000 years. This is 
in striking agreement with the point estimate of 409,000 years for the 
U. deningeri mtDNA”. We similarly estimated the divergence times of 
the major mitochondrial lineages (Table 1 and Extended Data Table 4) 
and find that the estimates for the divergence of the mtDNAs of the 


A Sima de los Huesos 


a Denisovans 


| Neanderthals 


Africans 


Asians and 
Europeans 


0.0090 


Figure 4 | Bayesian phylogenetic tree of hominin mitochondrial 
relationships based on the Sima de los Huesos mtDNA sequence determined 
using the inclusive filtering criteria. All nodes connecting the denoted 
hominin groups are supported with posterior probability of 1. The tree was 
rooted using chimpanzee and bonobo mtDNA genomes. The scale bar denotes 
substitutions per site. 
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Table 1 | Divergence times of the major hominin mtDNA lineages and the age of the Sima de los Huesos specimen as estimated by using three 


different filtering strategies for consensus calling 


Mitochondrial lineages 


Divergence dates and molecular age estimates in Myr BP (95% highest posterior density (HPD) intervals in brackets) 


Strict filters 


Strict filters enriched with human 
probes only 


Inclusive filters 


Sima de los Huesos age estimate 

Human — Neanderthal 

Sima de los Huesos/Denisova — Human/Neanderthal 
Sima de los Huesos — Denisova 


0.38 (0.18-0.60) 
0.51 (0.35-0.69) 
0.99 (0.70-1.33) 
0.73 (0.50-1.01) 


0.37 (0.16-0.59) 
0.53 (0.36-0.72) 
1.04 (0.72-1.41) 
0.76 (0.51-1.06) 


0.40 (0.15-0.64) 
0.44 (0.29-0.61) 
0.83 (0.54-1.11) 
0.65 (0.40-0.92) 


Sima de los Huesos hominin and Densiovans vary between 0.40 and 
1.06 million years with point estimates around 700,000 years ago. 

The fact that the Sima de los Huesos mtDNA shares a common 
ancestor with Denisovan rather than Neanderthal mtDNAs is unex- 
pected in light of the fact that the Sima de los Huesos fossils carry 
Neanderthal-derived features (for example, in their dental, mandibular, 
midfacial, supraorbital and occipital morphology*®””*). Denisovans 
were identified in 2010 based on DNA sequences retrieved from a 
manual phalanx and a molar found in southern Siberia””’. Based on 
analyses of their nuclear genome*"’ they are a sister group of Neanderthals, 
although the mtDNAs of Neanderthals and present-day humans share an 
mtDNA ancestor more recently with each other than with Denisovans”’. 
This may be owing to either incomplete lineage sorting in the common 
ancestral populations of these groups or to gene flow into Denisovans 
from another archaic group’. 

Several evolutionary scenarios are compatible with the presence of 
a mtDNA sequence that falls on the Denisovan mtDNA lineage in a 
~400,000-year-old hominin in western Europe. First, the Sima de los 
Huesos hominins may be closely related to the ancestors of Denisovans. 
However, this seems unlikely, because the presence of Denisovans in west- 
ern Europe would indicate an extensive spatial overlap with Neanderthal 
ancestors, raising the question how the two groups could genetically 
diverge while overlapping in range. Furthermore, although almost no 
morphological information is available for Denisovans, a molar that 
carries Denisovan DNA is of exceptionally large size? and does not 
exhibit the cusp reduction seen in the Sima de los Huesos hominins’. 
Most importantly, the Sima de los Huesos specimen is so old that it 
probably predates the population split time between Denisovans and 
Neanderthals, which is estimated to one-half to two-thirds of the time 
to the split between Neanderthals and modern humans, which is esti- 
mated to be 170,000 to 700,000 years ago’. Second, it is possible that the 
Sima de los Huesos hominins represent a group distinct from both 
Neanderthals and Denisovans that later perhaps contributed the mtDNA 
to Denisovans. However, this scenario would imply the independent 
emergence of several Neanderthal-like morphological features in a 
group unrelated to Neanderthals. Third, the Sima de los Huesos homi- 
nins may be related to the population ancestral to both Neanderthals 
and Denisovans. Considering the age of the Sima de los Huesos remains 
and their incipient Neanderthal-like morphology, this scenario seems 
plausible to us, but it requires an explanation for the presence of two 
deeply divergent mtDNA lineages in the same archaic group, one that 
later recurred in Denisovans and one that became fixed in Neanderthals, 
respectively. A forth possible scenario is that gene flow from another 
hominin population brought the Denisova-like mtDNA into the Sima 
de los Huesos population or its ancestors. Such a hominin group might 
have also contributed mtDNA to the Densiovans in Asia®’®. Based on 
the fossil record, more than one evolutionary lineage may have existed 
in Europe during the Middle Pleistocene’’. Several fossils have been 
found in Europe as well as in Africa and Asia that are close in time to 
Sima de los Huesos but do not exhibit clear Neanderthal traits. These 
fossils are often grouped into H. heidelbergensis, a taxon that is difficult 
to define®’*”’, particularly with regard to whether the Sima de los 
Huesos hominins should be included*. Furthermore, there may have 
been relict populations of still earlier hominins, notably those classified 
as Homo antecessor, which share some morphological traits with Asian 


Homo erectus*® and have been found just a few hundred metres away 
from Sima de los Huesos in Gran Dolina. 

Although nuclear sequence data are needed to clarify the genetic 
relationship of the Sima de los Huesos hominins to Neanderthals and 
Densiovans, the mtDNA sequence establishes an unexpected link between 
Denisovans and the western European Middle Pleistocene fossil record. 
Future efforts will now focus on describing the mtDNA variation of the 
Sima de los Huesos hominins and retrieving nuclear DNA sequences 
from them. The latter will be a huge challenge given that almost two 
grams of bone were required to generate the mtDNA sequence even 
though several hundred copies of mtDNA exist per cell. Although 
preservation of DNA for such long periods of time may be favoured 
by unique preservation conditions in the Sima de los Huesos, the 
present results show that ancient DNA sequencing techniques have 
become sensitive enough to warrant further investigation of DNA 
survival at sites where Middle Pleistocene hominins are found. 


METHODS SUMMARY 


A detailed description of the methods used for data generation and analysis is 
provided in the Methods section. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Perturbed neural activity disrupts cerebral 
angiogenesis during a postnatal critical period 


Christina Whiteus*, Catarina Freitas’ & Jaime Grutzendler!* 


During the neonatal period, activity-dependent neural-circuit 
remodelling coincides with growth and refinement of the cerebral 
microvasculature’”. Whether neural activity also influences the 
patterning of the vascular bed is not known. Here we show in neo- 
natal mice, that neither reduction of sensory input through whis- 
ker trimming nor moderately increased activity by environmental 
enrichment affects cortical microvascular development. Unexpectedly, 
chronic stimulation by repetitive sounds, whisker deflection or motor 
activity led to a near arrest of angiogenesis in barrel, auditory and 
motor cortices, respectively. Chemically induced seizures also caused 
robust reductions in microvascular density. However, altering neural 
activity in adult mice did not affect the vasculature. Histological 
analysis and time-lapse in vivo two-photon microscopy revealed that 
hyperactivity did not lead to cell death or pruning of existing vessels 
but rather to reduced endothelial proliferation and vessel sprouting. 
This anti-angiogenic effect was prevented by administration of the 
nitric oxide synthase (NOS) inhibitor L-NAME and in mice with 
neuronal and inducible NOS deficiency, suggesting that excessive 
nitric oxide released from hyperactive interneurons and glia inhib- 
ited vessel growth. Vascular deficits persisted long after cessation 
of hyperstimulation, providing evidence for a critical period after 
which proper microvascular patterning cannot be re-established. 
Reduced microvascular density diminished the ability of the brain 
to compensate for hypoxic challenges, leading to dendritic spine loss 
in regions distant from capillaries. Therefore, excessive sensorimotor 
stimulation and repetitive neural activation during early childhood 
may cause lifelong deficits in microvascular reserve, which could 
have important consequences for brain development, function and 
pathology. 

The development ofa cerebral microvascular network that precisely 
matches regional metabolic demands is crucial given the brain’s high 
energy consumption and susceptibility to ischaemia’. Although major 
cerebral vessels form during embryonic development, microvascular 
sprouting and pruning continue into the neonatal stages’, concurrent 
with synaptogenesis, axonal growth and gliogenesis. Common molecu- 
lar pathways regulate angiogenesis and axonal growth’, suggesting that 
coordinated mechanisms establish a microvascular network that meets 
the requirements of adjacent neural tissue. Although some studies 
suggest there is a link between neural activity and microvascular plas- 
ticity”, this remains controversial and it is unclear whether neural 
activity regulates vascular development or if angiogenesis follows an 
autonomous developmental program”. 

To address this question, we examined the effects of neural activity 
on cerebral microvascular development in neonatal mice. First, we 
reduced sensory input to the barrel cortex by bilateral whisker trim- 
ming for 10 days beginning at postnatal day 15 (P15). This reduces 
spiking activity and metabolism”, and affects dendritic spine dynamics’” 
in the barrel cortex. We quantified vascular branch points and total 
length from confocal images of various vascular markers (Supplemen- 
tary Fig. la-e and Supplementary Video 1) and found that this mani- 
pulation did not affect vascular density in the barrel cortex (Fig. laand 


Supplementary Fig. 2a, b). Moderate whisker stimulation by environmental 
enrichment over 10 days also had no effect on microvascular density 
(Fig. la and Supplementary Fig. 2a, c). Therefore baseline sensory 
activity does not modulate neonatal cortical angiogenesis. 

Unexpectedly, more persistent and repetitive activity led to reduced 
vascular density. Exposure to diverse tones, natural sounds and white 
noise over 10 h daily from P15 to P25 caused robust reductions in 
vessel branching and length (Fig. 1b, c and Supplementary Fig. 2a), 
which increased in magnitude when stimulation was extended (Sup- 
plementary Fig. 2d). This effect was specific to the stimulated region, as 
vascular density was reduced in the primary auditory cortex but not in 
the cingulate cortex (Fig. 1b). We then tested the effect of sustained 
whisker stimulation by performing unilateral whisker trimming and 
exposing mice to continuous air current. Daily 10-h stimulation for 8 days 
led to significant reductions in microvascular density of the barrel 
cortex corresponding to the stimulated whiskers (Fig. 1d and Sup- 
plementary Fig. 2a). Similarly, 3 h of daily treadmill running for 5 days 
reduced vessel density specifically in the motor cortex (Fig. le and 
Supplementary Fig. 2a). Interestingly, vascular reductions following 
auditory or whisker stimulations were most apparent in cortical layers 
2/3 and 4, whereas motor hyperactivity had a more significant effect in 
layers 5 and 6 (Supplementary Fig. 2e—g). This is likely to be due to the 
fact that sensory cortical layers 2/3 and 4 connect to afferent inputs 
from the thalamus that are most reliably activated following stimu- 
lation’’, whereas layer 5 efferent neurons in motor cortex are robustly 
activated during motor output". 

Over a 10-day period between P15 and P25, repetitive stimulation 
led to differences of up to 13% in branching and 8% in length between 
control and stimulated mice. This slight discrepancy is not surprising 
as most vessels formed postnatally are short capillaries that contribute 
substantially to branching but less so to length’. Although this decrease 
may seem modest, it represents a 70% reduction in the numbers of new 
vascular branches formed and a 80% reduction in length growth when 
compared to un-manipulated animals (Fig. 1f, g and Supplementary 
Fig. 2a). In contrast to neonates, chronic auditory stimulation of adult 
mice did not affect vascular density (Fig. 1b). This is likely to be due to 
the fact that brain endothelial proliferation and sprouting become very 
restricted after the postnatal period’. 

To test whether epileptiform activity also affects neonatal angiogen- 
esis, we administered the cholinergic muscarinic agonist pilocarpine 
for 10 days, which induced mild generalized seizures without tonic- 
clonic convulsions lasting for approximately 2 h (Supplementary Fig. 3). 
These seizures caused robust reductions of cortical vessel density in 
neonates (Fig. 1h and Supplementary Figs 2a and 4), but did not affect 
the adult microvasculature (Fig. 1h). We tested a second seizure mech- 
anism, independent of cholinergic stimulation by intracortical injec- 
tion of tetanus toxin, which blocks neurotransmission in inhibitory 
interneurons. This treatment also induced non-convulsive seizures, 
after which we observed significant vessel reductions in the cortex, 
contralateral (Fig. 1i) and ipsilateral (data not shown) to the injection. 
Botulinum toxin, a structurally similar molecule that blocks excitatory 
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Figure 1 | Increased levels of neural activity during postnatal development 
lead to reduced microvascular density. a, Cortical microvascular density is 
not affected by reduced neural activity caused by whisker trimming or moderate 
enhancement of activity by environmental enrichment. b-e, Prolonged and 
repetitive activity through auditory stimulation using a variety of tones and 
sounds (b, c), increased unilateral deflection of whiskers by continuous air flow 
(d; stimulated hemisphere compared to unstimulated hemisphere contralateral 
to whisker-trimmed side), and running on a treadmill (e), cause reduced vessel 
branching in auditory (A1), sensory barrel (SB), and motor (M1) cortices 


neurotransmission, had no effect on vessels, supporting the conclusion 
that only hyperactivity affects microvascular remodelling (Fig. li). 

To determine whether hyperactivity blocks vessel formation through 
stress or energetic depletion, we administered the glucocorticoid dexa- 
methasone, a model of stress, or the glucose analogue, 2-deoxyglucose, 
to impair glucose metabolism. However, neither of these treatments 
affected the cortical vasculature (Fig. 2a). Furthermore, hyperstimu- 
lation was not associated with cell death, changes in cell density, or 
microglia activation (Supplementary Fig. 5). This suggests that reduc- 
tions in vessel density are not secondary to homeostatic alterations, but 
a direct vascular response to hyperactivity. 

We further examined the effect of hyperactivity on endothelial pro- 
liferation by repeated bromodeoxyuridine (BrdU) administration and 
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respectively. Vessel density in control cortical areas (cingulate cortex, CC, and 
piriform, PIR) was unaffected. Auditory stimulation did not affect adult 
vasculature (b). f, g, Baseline cortical angiogenesis is robust between P15 and 
P25, as seen in representative images (g). h, i, Seizures caused by pilocarpine 
(h) or tetanus toxin (i) intracortical injections arrested vessel growth in 
neonates but not adults. Intracortical botulinum toxin injections caused no 
vessel changes (i). Scale bars, 200 um (c, g). m numbers are given in brackets. 
P values, one-tailed student’s t-test. Bars represent s.e.m. 


found that both auditory stimulation and seizures markedly decreased 
proliferation selectively in endothelial cells (Fig. 2b-f and Supplemen- 
tary Fig. 6). To determine how activity influenced vascular plasticity, 
we performed in vivo transcranial two-photon time-lapse imaging of 
mice expressing green fluorescent protein in their endothelium (Tie2- 
GFP). We found that administration of pilocarpine for 5 days caused 
90% fewer microvascular formations (Fig. 2g, h, j), resulting in a signi- 
ficant reduction in vascular length added (Fig. 2k). In contrast, the number 
of eliminated branches was not different from controls (Fig. 2g, i, j). These 
results demonstrate that perturbed activity does not cause vascular regres- 
sion, but instead arrests the proliferation and sprouting of new vessels. 

Nitric oxide (NO) is released during neural activation and is known 
to modulate angiogenesis’. We therefore asked whether NO is involved 
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Figure 2 | Neural hyperactivity reduces 
endothelial proliferation and new vessel 
formations in the neonatal cortex. a, Daily 
treatment with 2-deoxyglucose (2DG) or 
dexamethasone (DEX) does not affect vessel 
density. b, Auditory stimulation reduces 
endothelial cell (EC) proliferation, measured by 
BrdU, in primary auditory but not in cingulate 
cortex control area. Non-endothelial cell (NEC) 
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in the anti-angiogenic effects of neural activity. In the brain, NO is 
produced by three NOS isoforms: endothelial NOS (eNOS) and neur- 
onal NOS (nNOS), which are constitutively active in endothelial cells 
and neurons, respectively; and inducible NOS (iNOS), which increases 
during inflammation, but is also present at baseline in glia and neu- 
rons'*'®, Studies using NO donors or NOS overexpression have shown 
that NO is pro-angiogenic at moderate levels and anti-angiogenic at 
high physiological concentrations” **. Notably, we found that activity- 
mediated vessel reductions were completely prevented by administra- 
tion of a broad NOS inhibitor (L-NAME; L-NG-nitroarginine methyl 
ester), but not its enantiomer D-NAME (Fig. 3a, b). We then exposed 
isoform-specific loss-of-function mutants to auditory stimulation and 
found that mice lacking nNOS or iNOS were completely protected 
against activity-mediated vessel reductions, whereas eNOS-knockout 
mice were not (Fig. 3c, d). nNOS is expressed in subtypes of inhibitory 
interneurons”, which are over-activated during prolonged stimu- 
lation, and glial iNOS also responds to changes in neuronal activity’’. 
Therefore it is likely that activity-dependent NO production is respon- 
sible for the vascular effects observed (Supplementary Fig. 7 and Sup- 
plementary discussion 2). Although we found no changes in nNOS and 
iNOS protein levels following stimulation (Supplementary Fig. 8a), 
these isoforms produce NO very efficiently, therefore modest changes 
in their catalytic activity have strong effects on NO production. The 
anti-angiogenic effects of hyperactivity did not appear to be mediated 
by vascular endothelial growth factor (VEGF) because stimulation had 


in either location (see Supplementary Fig. 6). 

c, Pilocarpine injection cause decreased EC 
proliferation compared to saline-injected controls. 
NEC proliferation is not affected. d, Representative 
confocal image of cortical section stained with 
Glutl1 (endothelium, green), Pdgf receptor-B 
(Pdgfr; pericytes, red), and BrdU (nuclei, blue) 
shows dividing endothelial cells (white 
arrowheads) near a non-proliferating pericyte 
(yellow arrowhead). e, Collagen IV and BrdU 
double staining shows two proliferating EC nuclei 
(white arrowheads; surrounded on one side by 
basal lamina) and a non-proliferating pericyte 
(surrounded luminally and abluminally by basal 
lamina; yellow arrowhead). f, Orthogonal views of 
an endothelial cell within the collagen IV basal 
lamina. g, Image of a proliferating pericyte (note 
that proliferating pericytes are not surrounded on 
all sides by detectable basal lamina). h-l, In vivo 
two-photon imaging reveals that hyperactivity 
inhibits new vessel formations. Representative two- 
photon time-lapse images show eliminations (blue 
arrowheads) of a sprout (h) and a vessel (j), and 
formations (yellow arrowheads) ofa sprout (i) and 
a vessel (h) between time point 1 (left panel, P15) 
and 2 (right panel, P20). k, Percentage of vascular 
branches formed or eliminated during a 5-day 
treatment with either pilocarpine or saline. 

1, Significantly less vessel-length increase in 
pilocarpine-injected mice between P15 and P20 
than in saline-injected controls. Scale bars, 10 ym 
(d, e, g), 5 um (f), 50 um (h-j). P values, one-tailed 
student’s t-test. Bars represent s.e.m. 


no measurable effect on VEGF or VEGF-receptors levels (Supplemen- 
tary Fig. 8b-d). 

Our data show that perturbation of neural activity reduces vascular 
growth by 70-80% following activity paradigms and near 100% fol- 
lowing seizures. This growth arrest is likely to affect micro-regional 
tissue oxygenation because the brain, unlike other organs, has high 
energetic demand, minimal energy storage, and no un-perfused micro- 
vascular reserve’. To determine the effects of vessel reductions on brain 
homeostasis, we devised a method to spatially correlate regional tissue 
oxygenation with the relative proximity to capillaries using the tissue 
hypoxia probe pimonidazole. At normal oxygen levels, no pimonida- 
zole signal was observed in either control or hyperstimulated mice (Sup- 
plementary Fig. 9). Interestingly, when mice were exposed to 8% O} for 
1 h, we observed a greater pimonidazole signal in mice with reduced 
microvascular density caused by hyperactivity (Fig. 4a—d). Brain micro- 
regions distant from vessels experience lower oxygen tension at base- 
line and following neural activation”*™*. Consistent with this, in control 
mice we observed a trend towards increased pimonidazole signal in 
areas further than 10 1m from blood vessels. In hyperstimulated mice 
this difference was more pronounced, suggesting that reduced micro- 
vascular density impairs oxygen delivery to areas distant from capil- 
laries (Supplementary Fig. 10). 

To explore the effect of this diminished oxygen delivery on neuronal 
connectivity we measured dendritic spine density of layer V pyramidal 
neurons after exposure to moderate hypoxia (8% O, for 48 h). Mice 
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Figure 3 | Inhibition of neuronal and inducible nitric oxide prevents 
vascular growth arrest in response to activity. a, Mice treated with a NOS 
inhibitor (L-NAME) before daily audio stimulation sessions do not experience 
vessel reductions following auditory stimulation between P15 and P25; 
however, pre-treatments with D-NAME and L-arginine do not rescue vessel 
growth-arrest. b, Treatment with L-NAME, D-NAME and L-arginine in the 
absence of stimulation does not affect cortical vascular density. c, Mice deficient 
in nNOS or iNOS do not experience vessel growth-arrest following auditory 
stimulation. However, eNOS-deficient mice do experience growth arrest. 

d, Baseline vessel density of all three NOS-deficient strains is not altered 
compared to wild type. P values, one-tailed student’s t-test. Bars represent s.e.m. 


that had previously been exposed to repetitive auditory stimuli experi- 
enced a reduction in spine density in auditory cortex, specifically in 
areas distant from capillaries, whereas unstimulated mice did not 
experience spine loss (Fig. 4e, f). This demonstrates that the observed 
microvascular reductions can cause synaptic loss after a mild reduction 
in environmental oxygen and suggests that other metabolic challenges 
or increases in demand may have similar effects. In addition to changes 
in spine number, our results suggest that impaired oxygen delivery to 
intercapillary regions may be involved in the alterations in cortical 
tonotopic mapping and auditory discrimination observed after repet- 
itive auditory stimulation in neonates’. 

We next sought to determine whether microvascular reductions 
were long-lasting by exposing mice to auditory stimulation and asses- 
sing vessel density at various time intervals. Neonates given a 5-day 
auditory stimulus completely regained normal vascular density 1 month 
later (Fig. 4g), probably because hyperstimulation was stopped before 
the end of the first postnatal month, when substantial angiogenesis is 
still ongoing’. However, when auditory stimulation was extended to at 
least 15 days, vessel density did not return to normal even after 5 months 
(Fig. 4h). This indicates the existence of a critical period during which 
the vasculature can recover from anti-angiogenic intervention, but 
after which deficits become permanent. Therefore a temporary exposure 
to hyperactivity in neonates causes permanent alterations in microvas- 
cular architecture, leading to deficits in oxygen delivery and impaired 
neuronal connectivity (Supplementary Fig. 11). 

Our study shows that modest alterations in baseline levels of neural 
activity do not affect vascular patterning, suggesting that in the post- 
natal period brain angiogenesis follows an autonomous developmental 
program. Unexpectedly, this program can be disrupted by increased 
levels of nitric oxide released from neurons and glia following repet- 
itive sensory-motor stimulation or seizures. This effect is probably 
maladaptive, given that such repetitive activity patterns are not likely 
to have been prevalent through evolution. However, these findings 
raise the concern that early childhood seizures” or exposure to repet- 
itive auditory and other sensory-motor stimuli, which are common in 
modern society”, could have lifelong repercussions on the cortical 
microvasculature, its oxygen delivery capabilities, and the homeostasis 
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Figure 4 | Activity-mediated microvascular deficits are long-lasting and 
affect brain oxygenation and dendritic spine stability in areas distant from 
capillaries. a—d, Activity-mediated vessel reductions lead to increased levels of 
hypoxia both in auditory-stimulated (a) and pilocarpine-injected (c) neonatal 
mice. Collagen IV vessel label (green) and pimonidazole tissue hypoxia label 
(red) in primary auditory cortex (A1) of control and auditory-stimulated 
mice (a). Averaged distribution of pimonidazole pixel intensities shows 
brighter pixels in the stimulated group (b) (the average percentage pixels at each 
pixel intensity were compared between animals for each treatment group using 
a t-test; n = 5 control, n = 4 auditory stimulated). c, Representative images 
from the cingulate cortex (CC) of saline and pilocarpine-injected mice. 

d, Distribution of pimonidazole brightness shows increased labelling in mice 
stimulated with pilocarpine (n = 5 animals per treatment group). AU, arbitrary 
units. e, f, Dendritic spine density in layer 2/3 and 4 of auditory cortex in Thyl- 
YFPh mice, expressing fluorescent protein in layer 5 pyramidal neurons, was 
quantified using high-resolution confocal images. Spines per unit length (jum) 
of dendrite were calculated in areas closer and further than 20 um from the 
nearest vessel. e, Left panel, projecting dendrites (green) and vessels (white). 
Right panel, a dendrite studded with spines. f, Control and audio-treated mice 
were left in normoxia or exposed to 8% oxygen for 48 h and spine density was 
quantified. g, h, Vascular deficits following a 5-day auditory stimulation are 
fully recovered 1 month after the stimulus ended (g); however, auditory 
stimulation for at least 15 days leads to permanent vessel loss up to 5 months 
post stimulation (h). Scale bars, 200 jim (a, c), 100 um, inset 50 jim (e). 

Bars represent s.e.m. 


of neural cells. In addition, it may make the brain vulnerable to con- 
ditions of reduced oxygen supply or microvascular pathology such as 
hypertension, diabetes and ageing. 


METHODS SUMMARY 


Neonatal or adult mice were exposed daily to various sensorimotor stimulation 
paradigms (auditory stimulation, whisker deflection, treadmill exercise or chem- 
ically induced seizures) for various time intervals. High-resolution confocal micro- 
scopy of vascular and proliferation markers or in vivo two photon microscopy in 
mice expressing GFP in endothelial cells was performed to quantify vascular 
density, proliferation, and rates of vessel formation and elimination. Reduction 
of nitric oxide synthase (NOS) was achieved by injection of L-NAME or in mice 
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lacking nNOS, iNOS or eNOS. Measurement of the effects of microvascular 
reductions on tissue oxygenation was accomplished by microregional densito- 
metric analysis of pimonidazole fluorescence using custom-made Image]-based 
macros. Dendritic spines were quantified in regions near or far from microvessels 
using confocal microscopy of mice expressing yellow fluorescent protein (YFP) 
under a neuronal promoter. A full description of methods is available in the online 
version of the paper. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Mice. Wild-type mice (Charles River), Tie2-GFP endothelial reporter (Jackson 
Laboratory no. 003658),Thy1-YFP (Jackson Laboratory no. 003782), neuronal 
nitric oxide synthase knockdown mice (Jackson Laboratory no. 002986), inducible 
NOS knockout mice (Jackson Laboratory no. 002609), and endothelial NOS 
knockout mice (Jackson Laboratory no. 002684) were on a C57BI6 background. 
Mice between postnatal age P5 and P200 were used. In all experiments littermates 
were assigned to treatment or control groups and equally distributed by sex and 
bodyweight (although neither of these factors influence vessel density; Sup- 
plementary Fig. 12d, e). Pups were housed jointly with their mother except for 
the duration of stimulation or injections. For adult experiments 9-week-old 
females were used. Sample sizes for each experiment were chosen based on the 
size of the litter (typically n = 5 per group) and in some cases repeated to confirm 
with different litters. Experimental protocols were in accordance with the relevant 
guidelines and regulations of the Institutional Animal Care and Use Committee at 
Northwestern University and Yale University. 

Treadmill exercise, auditory and whisker stimulations. In our exercise experi- 
ments, pups starting at age P15 were placed in individual lanes on a custom-made 
treadmill to induce moderate running speeds (15 cm s_!) over 45 min, 3 times 
daily for 5 days. 

For auditory experiments, pups were separated from their mothers and exposed 
overnight to 10 h of a variety of sounds (40-75 dB) consisting of white and pink 
noise, tones over a range of frequencies, frequency sweeps, recordings of rodent 
vocalizations, and other natural sounds. Littermate controls were also separated 
from their mothers and kept in a quiet room. Stimulation began as early as P15 and 
lasted 5, 10 or 15 days. For long-term experiments, mice were stimulated between 
15 and 30 days and then returned to their home cages for 1, 3 or 5 months before 
being euthanized. 

To stimulate whiskers unilaterally, unilateral whisker trimming was performed 
daily and mice were subsequently placed into a cage with a continuous airflow for 
10h over 8 days. Vessel density in the stimulated barrel cortex (contralateral to the 
stimulated whiskers) was compared to the unstimulated (trimmed) cortical region. 
Pilocarpine-induced seizures. Pilocarpine was administered intraperitoneally 
(72 mg kg *, P6503, Sigma) as early as P5. This dosage is substantially lower than 
that used to induce chronic epilepsy’. Starting at P7 mild seizures were observed 
following injection. These were characterized by reduced exploratory behaviour, 
excessive salivation, forelimb clonus and occasional head bobbing, which ceased 
after 2 h. Mice received daily injections over 10 days. No evidence of spontaneous 
seizures was noted during this period. 

Intracortical tetanus and botulinum toxin injections. For intracortical injec- 
tions, a craniotomy is made over the cingulate cortex of anaesthetized mice by 
thinning the skull with a dental drill and using a fine needle to remove a small (100 um 
diameter) piece of skull. Using a fine glass capillary mounted to a stereotaxic appar- 
atus, tetanus toxin*' (25 ng; 190A, List Biological Labs) or botulinum toxin (0.03 ng; 
128A, List Biological Labs) was diluted in 7 l of artificial cerebral spinal fluid 
(ACSF) and was injected 500 um deep into the cingulate cortex. A CamKII-Tomato 
adeno-associated reporter virus was added to the injection mix to demarcate the 
injected area for later relocation. An equivalent volume of ACSF with virus was 
injected into control mice. Mice experienced spontaneous seizures, demonstrated 
by head bobbing and facial spasms a few days after injection, which continued to 
occur until they were sacrificed 10 days later. 

Models of impaired metabolism and stress and nitric oxide manipulations. 
Starting at P15 mice were given 10 daily intraperitoneal injections of either 2- 
deoxyglucose (2-DG, 10 mg kg '; D8375, Sigma), which impairs glucose meta- 
bolism, or the glucocorticoid dexamethasone (5 mg kg~ 1 p4902, Sigma), which is 
a widely used model of stress. To manipulate nitric oxide levels, the nitric oxide 
inhibitor L-NAME (50 mg kg” ',N5751, Sigma), the nitric oxide substrate L-arginine 
(50 mg kg *, A5006, Sigma), and the L-NAME enantiomer D-NAME (50 mgkg’ *, 
N4770, Sigma) were diluted in saline and injected intraperitoneally 30 min before 
onset of each stimulation session. 

In vivo transcranial two-photon time-lapse imaging. Cerebral blood vessels 
were imaged in Tie2-GFP endothelial reporter mice. Neonatal time lapse imaging 
was performed on the two-photon microscope as previously described’. In brief, 
mice were anaesthetized with isoflurane and the skull was exposed with a midline 
scalp incision. A 1-mm-diameter skull region over the somatosensory cortex was 
thinned with a microsurgical blade to a final thickness of approximately 30 um. 
The surrounding skull was attached to a custom-made steel plate to stabilize the 
head while imaging. A CCD camera image of skull blood vessels allowed for re- 
location of the imaging area. The first time point was acquired at P15, and then 
mice received daily intraperitoneal injections of pilocarpine or saline for 5 days. 
One day after the final injection, mice were prepared for imaging and blood vessels 
were re-located and imaged for a second time point. 


Hypoxia stress test. Mice were exposed to 15 days of auditory stimulus or 10 days 
of pilocarpine seizures. Mice were then injected intraperitoneally with Hypoxyprobe-1 
(pimonidazole Hcl 50 mg kg '; HP1-1000Kit, Hypoxyprobe) to label hypoxic 
tissue. Note that the threshold for pimonidazole detection is <10 mm Hg tissue 
O2 and may therefore not detect mild O, reductions. Ten minutes after injection of 
the probe, mice were placed into a hypoxic chamber set at 8% oxygen for 1 h. Mice 
were anaesthetized, while still in the hypoxic chamber and, when they were no 
longer responsive, they were removed and immediately euthanized by transcardial 
perfusion. 

Imaging of dendritic spines in hypoxic brains. Dendritic spines were visualized 
in Thyl-YFPh-line reporter mice, which express yellow fluorescent protein in 
layer V pyramidal projection neurons. Mice were given 10 days of auditory stimu- 
lation and then placed in a hypoxic chamber at 8% for 48 h. Control littermates 
received no stimulation but were exposed to hypoxia. Animals were injected 
intravenously with NHS biotin to label vessels and sacrificed by transcardial per- 
fusion. In a separate group of mice, animals were exposed to auditory stimulation 
or control conditions and immediately sacrificed to determine baseline changes in 
spine density. 

Tissue collection and histology. Mice were anaesthetized by intraperitoneal 
injection with ketamine-xylazine (120 mg ml _':10 mg ml” ') and euthanized by 
transcardial perfusion using 4% paraformaldehyde (PFA). Brains were post-fixed, 
bathed in sucrose and cut on the cryostat into 50-11m coronal sections. For western 
blot analysis, fresh cortices were dissected from the rest of the brain under a dis- 
secting microscope and flash frozen. For brain weight measurements, un-perfused 
brains were extracted and immediately weighed. 

Vessels were stained in fixed tissue using collagen IV antibody (1:250; -ab19808, 
Abcam), Glut-1 (1:100; 2186307, Millipore) or isolectin B4 (1:50; B1205, Vector 
Labs) as previously described’. Additional vessel labelling was achieved by trans- 
cardial injection of fluorescein-tomato-lectin (200 pl; FL1171, Vector Labs) or 
NHS-biotin (300 mg kg *; 2027, Thermo), which required a conjugated strepta- 
vidin counterstain. 

For BrdU labelling, mice were injected intraperitoneally every second day with 
BrdU (5 mg kg’ *; B5002, Sigma). Brain sections were washed, denatured with 5M 
HCl for 15 min and incubated with rat anti-BrdU (1:400, OBT0030, AbDSerotec). 

Staining for other cell types was performed using the following antibodies: 
NeuN (1:250; MAB377, Millipore) to label neurons, IBA1 (1:200; 019-19741, 
Wako) for microglia, PdgfrB (1:100; AF1042, R&D) for pericytes, VEGF (1:100; 
ab46154, Abcam). To label VEGF pathway components VEGER2 (1:1,000; 55B11 
Cell Signaling), Phospho-VEGFR2 (Tyr 1175) (1:1,000; 19A10, Cell Signaling), 
Phospho-VEGFR2 (Tyr951) (1:1000; 7H11, Cell Signaling), and VEGFR1 (1:1000; 
ab32152, Abcam) were used. Hypoxyprobe-1 (1:50, HP1-1000Kit, Hypoxyprobe) 
was used to detect injected pimonidazole, and c-Fos (1:100; F7799, Sigma) was 
used to detect immediate early protein detection. DAPI (1:1,000; 9542, Sigma) was 
used to quantify cellular densities. 

A TUNEL detection kit (11684817910, Roche) and cleaved caspase-3 antibody 
(1:100, Millipore; 559565, BD Pharmingen) were used to label dying and apoptotic 
cells. DNase treatment served as a positive control for TUNEL staining and a 
stroke induced by microembolism with 50-,tm beads injected into the carotid 
artery was a positive control for both TUNEL and caspase-3 labelling. AlexaFluor 
antibodies 488 and 555 (Life Sciences) were used throughout. For all experiments, 
sections from both groups were stained simultaneously, using the same batch of 
antibodies. 

VEGF protein levels were measured with mouse VEGF Quantikine Elisa Kit 

(MMV00, R&D) according to the manufacturer’s instructions using tissue from 
mice treated with pilocarpine or saline between P15 and P25 and assayed in 
triplicate. 
Area selection and image acquisition for vascular quantification. Vascular 
density varies greatly by brain region, therefore the Paxinos and Allen brain atlas 
were used to carefully identify and locate cortical areas. For each area analysed, a 
standard coronal section, containing the relevant cortical region, was found. Three 
consecutive sections were selected from each mouse brain region and standardized 
regions were located bilaterally using the atlas (six regions of interest were obtained 
for quantification in each brain, see Supplementary Fig. 12a). To ensure uniformity 
of the sections, special care was taken that the angle of cryo-cutting was standard 
for all brains and cutting was performed during a single session. 

Primary auditory cortex, sensory barrel cortex and primary motor cortex were 
quantified for auditory, whisker and motor experiments, respectively. Control 
areas were always chosen from the same section: cingulate cortex used as a control 
for auditory and whisker experiments and the piriform cortex used for motor 
stimulation. For pilocarpine seizures, 2DG-, and dexamethasone-treated brains, 
the cingulate cortex was quantified. 

Using a Leica 0.8 NA 20% lens, an image of the entire cortical region of interest 
was obtained on a confocal microscope (Leica SP5) at X1 zoom, 1 |tm step size, 
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1,024 X 1,024 resolution, and a scan speed of 600 Hz. Imaging for an individual 
experiment was performed in one session, with uniform laser intensity and gain 
parameters used between control and experimental groups. 

Vessel density quantification and laminar layer determination. For quantifica- 
tion, vessels were labelled with isolectinB4, NHS-biotin or collagen IV. Z-projections 
were made from a fixed number of optical sections. All quantifications were carried 
out blind. Vascular length was quantified automatically using a custom-made 
ImageJ (NIH) macro in which vessels were thresholded to create a binary mask, 
despeckled, and skeletonized. Skeletonized vessels were used to obtain vessel 
length in microns (Supplementary Fig. 12a). Blinded manual counts of blood vessel 
branches were performed using Image J Cell Counting software (Supplementary 
Fig. 12b). To ensure accurate counts, six cortical regions and more than 2,000 
branches were counted in each animal. To quantify individual laminar layers, we 
used previously published measurements of cortical layers in barrel, auditory, and 
motor cortices**** to divide our images into the relevant laminae. 

Other cellular quantifications. We have previously shown that endothelial BrdU 
can be accurately separated from non-endothelial BrdU as well as peri-vascular 
BrdU using confocal images’. For BrdU, high-resolution confocal images were 
obtained using a Leica 1.3 40x HCLX PL APO oil immersion lens in the cingulate 
cortex for pilocarpine-treated mice, and in the auditory and cingulate cortex for 
auditory-stimulated mice. Z-projections of a standardized thickness were made 
and BrdU positive endothelial and non-endothelial cells were counted on Image] 
Cell Counting software. 

Quantification of total cell number (DAPI), neuron, and microglia number was 
performed in the cingulate cortex using ImageJ Cell Counting software. All man- 
ual counting was carried out blind. 

For co-localization of neurons with c-Fos, neurons and c-Fos images were 
separated and quantified individually on ImageJ Cell counter so that the percent- 
age of c-fos positive neurons could be calculated. 

Quantification of microvasculature in vivo. In vivo two-photon image stacks 
were cut to a standard thickness of 150 jum and vessels were quantified blind. To 
measure total vessel length, we used the program Fiji Simple Neurite Tracer, which 
allowed us to trace individual vessels in three dimensions. Total vessel length was 
determined for first and final time points and per-cent difference was calculated. 

Image] was used to quantify elimination and formation of branches in vivo by 
comparing first and second time points. New branches consisted of new sprouts 
(counted as one new branch), sprouts which had anastomosed to form a connected 
vessel (counted as one new branch), and vessels which became fully connected at 
the second time point (counted as two new branches). Eliminated branches con- 
sisted of fully formed vessels that had retracted such that only a sprout remained 
(counted as one eliminated branch), fully formed vessels which retracted comple- 
tely (counted as two eliminated branches), and sprouts which retracted completely 
(counted as one eliminated branch). Change in branches was determined by 
counting all branch points at time point 1 and then counting branches that had 
been added or lost at time point 2. Per-cent change in branch formation and 
elimination were calculated by dividing the number of new or lost branches by 
the total number of branches at time point 1. The observed vessel changes were 
evenly distributed across the entire tissue volume 
Statistics. For each mouse, vessel and cell-density quantifications were obtained 
from both hemispheres of three coronal sections (six images in total) and were 
averaged to obtain a single value. 

Although there is strong inter-litter consistency in vessel density, there are small 
but statistically significant differences in density between litters. Therefore all of 
our comparisons are carried out within litters (both controls and experimental 
animals belong to a single litter). However, as most experiments use multiple 
litters, we use a normalization method to add together different litters. In each 
litter, the experimental group is normalized so that the control group averages to 
100%, and following this normalization, multiple litters can be combined. Density 
values are distributed normally and a one tailed student’s t-test assuming equal 
variance was used to compare groups. Spine density and hypoxia were compared 
between areas near and far from vessels, the stimulated and unstimulated hemi- 
spheres were compared in whisker-stimulated mice, and all other comparisons 
were performed between control and experimental groups. 
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Tissue-hypoxia quantification. Sections were co-stained with collagen IV and 
pimonidazole antibodies, and images were obtained on Leica HyD detectors, con- 
focal photon-counting mode using a 1.0 NA 20X lens at X1 zoom, 1 jum step size, 
1,024 X 1,024 resolution, and a scan speed of 600 Hz. Primary auditory cortex was 
imaged following auditory treatment and cingulate cortex was used for pilocarpine 
seizure treatments and images were projected to uniform thickness (5 jim). 

We designed an ImageJ macro to quantify pimonidazole staining. In this, blood 
vessels are thresholded and a mask is created and projected over the pimonidazole 
staining so that only pimonidazole signal in non-vascular areas is quantified. A 
histogram of the pimonidazole staining is obtained which shows the distribution 
of the brightness of individual pixels. We use 8-bit images, therefore brightness is 
distributed over 256 bins, 0 representing no detectable signal and 256 representing 
the brightest. Because the non-vascular area may vary, brightness values were 
converted to percentages of the total number of pixels counted. Values for each 
brightness bin (0-256) are averaged across six images for each mouse, to obtain a 
single average histogram per animal. Group averages for each brightness bin are 
obtained by averaging the brightness values of individual animals and one-tailed 
t-tests are performed in each brightness bin to compare mice in different experi- 
mental groups. 

To determine the differences in pimonidazole brightness near and far from 
capillaries, a histogram representing tissue closer than 10 1m to vessels was com- 
pared to areas more than 10 jim away from the nearest vessel. This was done 
automatically using a custom-made ImageJ macro. In this macro a vessel mask was 
created and a histogram of total pimonidazole staining was obtained. Then the 
vessel mask was dilated to 10 jm to obtain a second histogram in the area further 
from vessels. This second histogram was subtracted from the initial histogram to 
obtain a value for the area closer than 10 1m to vessels. 
Dendritic-spine-density quantification. NHS biotin-injected brains were coun- 
terstained with fluorescently labelled streptdavidin to label vessels and sections 
were imaged on the confocal microscope using a Leica 1.4 NA 63% lens at x4 
zoom at 1 jm step size. Sixteen images (1,024 X 1,024 pixel each) were acquired to 
cover layers 2/3 and 4 of the auditory cortex and stitched together to reconstruct 
the dendritic projections. Tiled stacks were Z-projected to a standardized thick- 
ness (5 um) for quantification and four to six images were quantified for each 
animal. Using a custom-made Image] macro, vessels were thresholded and a mask 
was created. The vessel mask was dilated to 20 jm, to create a selection which 
outlined the area 20 jim from the nearest vessel. This selection was projected over 
the image of the dendrites to create a border between areas ‘near’ and ‘far’. 

Images were quantified blind to treatment. In each image we excluded dendrites 
that were out of focus or displayed weak YFP labelling, such that individual spines 
could not be identified. The remaining dendrites were traced to determine their 
total length in areas ‘near’ or ‘far’ from vessels. Spines were counted manually 
using Image] Cell Counter and density ‘near’ and ‘far’ from vessels was determined 
by dividing by the length of the dendrites quantified. An average density ‘near’ and 
‘far’ was determined for each animal and group averages ‘near’ and ‘far’ were 
compared using a student's t-test. 
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Respiratory surfaces are exposed to billions of particulates and 
pathogens daily. A protective mucus barrier traps and eliminates 
them through mucociliary clearance (MCC)'?. However, excessive 
mucus contributes to transient respiratory infections and to the 
pathogenesis of numerous respiratory diseases’. MUC5AC and 
MUCSB are evolutionarily conserved genes that encode structur- 
ally related mucin glycoproteins, the principal macromolecules in 
airway mucus’*. Genetic variants are linked to diverse lung dis- 
eases* °, but specific roles for MUC5AC and MUCSB in MCC, and 
the lasting effects of their inhibition, are unknown. Here we show 
that mouse Muc5b (but not MucSac) is required for MCC, for 
controlling infections in the airways and middle ear, and for main- 
taining immune homeostasis in mouse lungs, whereas MucSac is 
dispensable. Muc5b deficiency caused materials to accumulate in 
upper and lower airways. This defect led to chronic infection by 
multiple bacterial species, including Staphylococcus aureus, and to 
inflammation that failed to resolve normally’. Apoptotic macro- 
phages accumulated, phagocytosis was impaired, and interleukin- 
23 (IL-23) production was reduced in Muc5b~’~ mice. By contrast, 
in mice that transgenically overexpress Muc5b, macrophage func- 
tions improved. Existing dogma defines mucous phenotypes in 
asthma and chronic obstructive pulmonary disease (COPD) as driven 
by increased MUCS5AC, with MUCSB levels either unaffected or 
increased in expectorated sputum’*. However, in many patients, 
MUCS5B production at airway surfaces decreases by as much as 
90%°""'. By distinguishing a specific role for Muc5b in MCC, and 
by determining its impact on bacterial infections and inflammation 
in mice, our results provide a refined framework for designing 
targeted therapies to control mucin secretion and restore MCC. 
Mucosal surfaces are central interfaces between organisms and their 
external environments. Mucus-coated barriers defend against pathogens” 
and re-distributed commensal organisms’’. Gastrointestinal mucins pre- 
vent Helicobacter pylori growth”, colitis’* and colorectal carcinogenesis’». 
To test whether secreted airway mucins serve correspondingly import- 
ant roles, we examined MCC and responses to bacterial infections in 
MucSac’~ mice (ref. 16), Muc5b~’~ mice, and lung specific-Muc5b- 
overexpressing transgenic (Tg(Scgblal-Muc5b)) mice (Extended Data 
Fig. la—d). We identified unique mechanisms by which Muc5b mediates 
effective respiratory mucosal defence (Extended Data Fig. 2). In 
Muc5b ‘~ upper airways, olfactory gland glycoconjugates were absent, 
but nasopharyngeal surfaces were unaffected (Fig. 1a, b). Muc5ac and 
Muc5b were lacking in respective knockout airways, but tracheobronchial 


glycoconjugates increased in Muc5b~'~ mice owing to induced MucSac 
(Fig. 1c, d and Extended Data Figs 1f and 3). Despite retaining mucous 
phenotypes in many airway tissues, growth and survival were impaired in 
Muc5b '~ animals (Fig. le, f), whereas MucSac ’~ and Scgblal-Muc5b 
mice survived normally (Fig. 1f). Acute MCC was normal in MucSac /~ 
and Scgblal-Muc5b mice, but severely reduced in Muc5b-‘~ mice 
(Fig. 1g, h and Extended Data Fig. le), even though functional ciliated 
cells were present (Fig. 1d, i). Mucus transport was impaired in Muc5b/~ 
tracheal epithelial cells in vitro, confirming that defective clearance 
reflected altered mucociliary interactions specifically (Fig. 1j-] and 
Supplementary Videos 1 and 2). Collectively, these data identify non- 
redundant protective requirements for Muc5b in survival and MCC. 

Impaired MCC in Muc5Sb ‘~ mice was accompanied by abnormal 
breathing (Fig. 2a, b) and hypoxaemia (Fig. 2c). We assessed lung function 
in mechanically ventilated mice. Baseline airflow and responses to the 
bronchoconstricting agent methacholine were normal in the lower airways 
(Fig. 2d—f). Circumventing the upper airways restored ventilation in spon- 
taneously breathing trachestomized animals (Fig. 2g and Supplementary 
Videos 3 and 4). Thus, upper respiratory obstruction impeded airflow 
in Muc5b ’~ mice. Micro-computed tomography (micro-CT) confirmed 
this with radiological evidence of upper airway obstruction (Fig. 2h), 
and middle-ear effusion consistent with otitis media (Extended Data 
Fig. 4a, b). The latter was unexpected given associations between increased 
MUCSBand human otitis media !’. In Muc5b~’~ mice, but not Muc5ac /~ 
mice (data not shown), hair fragments encased in mucus-like material 
were consistently found in posterior nasopharynxes (Fig. 2i, j and 
Extended Data Fig. 4c) and middle ears (Fig. 3a and Extended Data 
Fig. 4d). Bacteria and inflammation in Muc5b-‘~ middle-ear lavage 
samples confirmed infectious otitis media (Fig. 3b, c). 

In Muc5b ‘~ lower airways, aspirated materials and inflammatory 
infiltrates were also common (Fig. 3d, e). Culturable bacteria in the 
lungs increased 2.9-21.6-fold over time, and 7.6-75-fold further in 
spontaneously moribund mice who also had increased bacteria in 
spleen cultures (Fig. 3f, g), suggesting that disseminated infections con- 
tributed to mortality. To test this hypothesis, mice were placed on 
antibiotic-supplemented diets. Antibiotics reversed spontaneous leth- 
ality and reduced lung bacterial burden, but did not restore normal 
ventilation in Muc5b-’~ mice (Fig. 3h-j). Thus, the cause of death in 
Muc5b ’~ mice was infectious, and not directly due to airflow limita- 
tion. These data identify a natural course in which MucSb deficiency 
causes upper airway obstruction, spontaneous infection of connecting 
auditory tubes and the lungs, and fatal bacteraemia. 
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Because of the importance of bacterial infection in Muc5b‘~ mice, 
we identified organisms whose accumulation in the lungs was norm- 
ally prevented by Muc5b-rich mucus. 16S rRNA phylotyping revealed 
19 genera and >25 species in the lungs. Most were oral, gut and skin 
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Fig. 7a-e). Thus, Muc5b is selectively required for preventing acquisi- 
tion and enrichment of virulent bacteria such as S. aureus in the 
airways. 
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Figure 2 | Muc5b deficiency causes severe upper airway obstruction. 

a, b, Airflow and ventilation in Muc5b*’* and Muc5b /~ mice. c, Oxygen 
saturation (SaO2) and rescue with 100% O, supplementation (fraction of 
inspired oxygen (FiO,), 1.0). d-f, Lung (R;,) and lower airway (Raw) resistance 
and hyperresponsiveness to methacholine (MCh) in mechanically ventilated 
mice. g, Upper airway bypass by tracheostomy (trach.) under normoxia (FiO, 
0.21). h, Upper airway obstruction in Muc5b ‘~ mice confirmed by micro-CT 
(red arrows, nasopharynxes). i, j, Hair encased in mucus-like plugs visible 
during necropsy. Scale bars, 1 mm. Error bars, s.e.m. *P < 0.05. Numbers in 
brackets, n mice. 
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Figure 3 | Infection is the cause of death in Muc5b ‘~ mice. a-c, Hyperaemic 
blood vessels (red arrows), hair (white arrows), bacteria and leukocytes (>95% 
neutrophils) in Muc5b~‘~ middle ears. d, e, Hair fragments and leukocytes 
in Muc5b-’~ lungs. f, g, Chronically accumulating bacteria in lungs, leading to 
moribundity and septicemia in Muc5b ’~ mice. h-j, Mortality, infection and 
ventilation with antibiotic treatment. k, Bacterial genera, and staphylococcal 
species in M uc5b /~ mice.1, Survival following intranasal S. aureus inoculation 


The airways are exposed to bacteria from birth onwards. MCC is 
considered the major route for their elimination. However, although 
Muc5b ‘~ mice showed severely impaired MCC, most survived several 
months with increased lung bacterial burden (Figs 1f and 3f, h), but 
were ultimately unable to control S. aureus infection (Fig. 3k, 1). Therefore, 
we investigated additional potential mechanisms for Muc5b-dependent 
airway defence. Purified MUC6 (but not MUCS5AC) inhibits H. pylori 
growth”, and purified MUCSAC (but not MUC2) inhibits Trichuris 
muris survival'®. However, neither MUC5AC nor MUCSB inhibited 
S. aureus growth (Extended Data Fig. 7f), suggesting that MucSb- 
mediated defence was not mediated by direct inhibitory interactions 
with S. aureus. 

We then investigated whether Muc5b deficiency affected inflam- 
matory defences. Leukocytes in Muc5b ‘~ lung lavage fluid were sig- 
nificantly altered. In the steady state, neutrophils and eosinophils 
accumulated, but lymphocytes were absent (Extended Data Fig. 8a). 
There were also marked changes in macrophages in Muc5b ’~ lungs. 
Macrophages accumulated over time (Fig. 4a). They frequently con- 
tained undigested cytoplasmic inclusions (Fig. 4b), and had impaired 
phagocytic functions (Extended Data Fig. 9a). By age 12 months, IL-23, 
a mediator of anti-microbial inflammatory responses normally pro- 
duced by macrophages and dendritic cells, was reduced by 93% in 
Muc5b~’~ mice (Fig. 4c and Extended Data Table 1). As Muc5b /~ 
mice became moribund over this timecourse, dysfunctional inflam- 
matory phenotypes progressed (Fig. 4d and Extended Data Fig. 8a). 
Macrophages that accumulated in airspaces were increasingly apoptotic 
(Fig. 4e, f). IL-23 remained significantly lower in moribund Muc5b/~ 
mice than controls (Fig. 4g), despite their severely infected states 
(Fig. 3f, k). We found no evidence that macrophages produce Muc5b 
(Extended Data Fig. 9b), so it is unlikely that these pathologies were cell 
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(10’ CEU per mouse). Actino, Actinobacillus sp.; E. coli, Escherichia coli; Lacto, 
Lactoobacillus sp.; Rx, antibiotic treatment; S. aureus, Staphylococcus aureus; 
S. hominis, Staphylococcus hominis; S. lentus, Staphylococcus lentus; S. sapro, 
Staphylococcus saprophyticus; Staph, Staphylococcus; Strep, Streptococcus; 

S. xylosus, Staphylococcus xylosus. Scale bars, 20 1m (d, e). Error bars, s.e.m. 
*P < 0.05. Numbers in brackets, n mice. 


autonomous. Rather, the ability of Muc5b /~ lungs to resolve inflam- 
mation following microbial exposure was significantly inhibited’. 

Apoptotic macrophage accumulation in Muc5b ‘~ mice suggests 
that in addition to acute microbial elimination, MCC is required for 
chronically maintaining phagocyte clearance and immune functions 
in vivo. We therefore tested the impacts of age and Muc5b expression 
on acute and resolving lung inflammation following S. aureus infec- 
tion. In young animals (age 3 months), acute neutrophilic responses 
were similar irrespective of genotype (Extended Data Fig. 8b). By 
contrast, ageing Muc5b ’~ animals (age 6 to 7 months) were exqui- 
sitely sensitive, exhibiting 100% mortality within the first few hours 
of infection, whereas Muc5b‘’* and Scegblal-Muc5b mice survived 
(Fig. 4h). Thus, the combined absence of MCC and the chronic effects 
of underlying spontaneous infections enhanced the deleterious con- 
sequences of S. aureus infection. Young mice survived to day 7 post 
infection, a peak time point of resolving inflammation’. Infected 
Muc5b ‘~ lungs exhibited increases in lung macrophages and mixed 
granulocytes (Fig. 4h and Extended Data Fig. 8c). Accumulated macro- 
phage pools were predominated by apoptotic cells (Fig. 4i) and linked 
to significantly reduced lung IL-23 (Fig. 4j), similar to spontaneously 
moribund Muc5b ‘~ mice. Conversely, in Segblal-Muc5b mice, IL-23 
production, macrophage activation, and S. aureus elimination were 
enhanced (Fig. 4j-1). 

Collectively, these data show that Muc5b expression affects inflam- 
mation and macrophage-mediated responses in the lungs. In its 
absence, mice failed to mount effective anti-bacterial responses. In its 
sustained presence, these were enhanced. Apoptotic cells, including 
neutrophils, accumulated in MCC deficient Muc5b~’~ airspaces 
(Fig. 4e and Extended Data Fig. 9c). Efferocytosis of neutrophils is 
reported to downregulate IL-23 (ref. 19), which may explain its decline 
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Figure 4 | Muc5b maintains 
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in the lungs. It will be of great interest to test the degrees to which 
Muc5b-dependent changes in macrophages are mediated by altered 
elimination owing to impaired MCC, and by other processes such as 
opsonization or pharmacologic interactions”. Nevertheless, these 
studies functionally link two fundamental first line host defence com- 
ponents; mucus and phagocytes. To our knowledge, Muc5b is the only 
bronchial secretory protein identified so far that demonstrates such 
singularly strong biological necessity. Mice survive normally in the 
absence of MucSac or anti-microbial peptides such as B defensin-1 
(ref. 21), calgranulin B”, lipocalin-2 (ref. 23), or lactoferrin™*. In addi- 
tion to controlling MCC and macrophage readiness, it is conceivable 
that Muc5b is a scaffold for these and other defensive molecules. 

The novel relationships between Muc5b, MCC and inflammation 
reported here have the potential to impact therapies for airway dis- 
eases. Cystic fibrosis, COPD and asthma are associated with mucus 
hypersecretion, impaired MCC, and increased risk for respiratory 
infections***’. MUCSAC and MUCS5B expression varies among indi- 
viduals, but the overall result is excessive mucin production'*""'. Such 
pathological associations have overshadowed the benefits of airway 
mucus and the potential functions of individual mucins in infec- 
tions’’*?>">1°°, Human MUCSB is highly polymorphic. A recently 
identified promoter variant found in approximately 20% of the general 
population increases its expression 37.4-fold in healthy lungs’. Our 
findings in Muc5b-deficient and Muc5b-overexpressing mice suggest 
that MUCS5B variants may regulate airway homeostasis, disease patho- 
genesis, and mucosal immune function in humans broadly. Although 
controlling mucin hypersecretion is an attractive therapeutic goal for 
transient and chronic lung diseases’, a therapy that completely dis- 
rupts mucus and inhibits MUC5B for extended periods may not be 
advisable. Instead, promoting adequate expression and enhancing 
MCC and airway defence in controlled manners are better strategies. 


METHODS SUMMARY 


Mice were used with University of Colorado, University of North Carolina Chapel 
Hill, and University of Texas M.D. Anderson Cancer Center Institutional Animal 
Care and Use Committee approvals. Muc5ac /~ mice were generated as described 
previously'®. Muc5b ’~ and Scgb1al-Muc5b mice were generated as part of the 
current studies. Muc5b protein was assessed immunohistochemically using rabbit 
polyclonal antisera. Ciliary beat, MCC, and transport were assessed as described 
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previously. Lung function was measured using a head-out plethysmograph and a 
flexiVent (Scireq), and blood oxygen was assessed using a pulse oximeter. Otitis 
media was assessed by visual otoscopy and middle ear lavage (MEL). Pulmonary 
inflammation was assessed by histology and lung lavage. Lavaged leukocytes were 
identified by light microscopy and flow cytometry. Neutrophils, macrophages, 
MHC-II, and apoptotic cells, were detected using commercially available antibodies 
and reagents. S. aureus was administered by 10 jl intranasal or 50 pl intratracheal 
inocula at 10’-10° colony-forming units per animal. Bacteria and bacterial DNA 
were isolated from MEL, lung homogenates and lung lavage pellets. Isolated col- 
onies were phylotyped by 16S ribosomal RNA sequencing. Kaplan-Meier (Figs 1f 
and 3h, 1), regression (Figs le and 2f), one-sided t-test (Figs 1g-i, k, 1, 2b-e, g, 3b, c, f, 
g, j, kand 4c, d, f, g, i, j), and one-way analysis of variance (ANOVA) (Figs 3i and 
4a, h, j, 1) with appropriate corrections for multiple comparisons, unequal variances, 
and non-Gaussian distribution were carried out using GraphPad Prism v5.04 
(GraphPad Software). 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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How biological systems generate reproducible patterns with high 
precision is a central question in science’. The shoot apical meristem 
(SAM), a specialized tissue producing plant aerial organs, is a devel- 
opmental system of choice to address this question. Organs are peri- 
odically initiated at the SAM at specific spatial positions and this 
spatiotemporal pattern defines phyllotaxis. Accumulation of the plant 
hormone auxin triggers organ initiation” °, whereas auxin depletion 
around organs generates inhibitory fields that are thought to be suf- 
ficient to maintain these patterns and their dynamics** *. Here we 
show that another type of hormone-based inhibitory fields, gen- 
erated directly downstream of auxin by intercellular movement 
of the cytokinin signalling inhibitor ARABIDOPSIS HISTIDINE 
PHOSPHOTRANSFER PROTEIN 6 (AHP6)", is involved in regu- 
lating phyllotactic patterns. We demonstrate that AHP6-based fields 
establish patterns of cytokinin signalling in the meristem that con- 
tribute to the robustness of phyllotaxis by imposing a temporal 
sequence on organ initiation. Our findings indicate that not one 
but two distinct hormone-based fields may be required for achiev- 
ing temporal precision during formation of reiterative structures 
at the SAM, thus indicating an original mechanism for providing 
robustness to a dynamic developmental system. 

In the most widely accepted theory for phyllotaxis, spatiotemporal 
patterns of organ initiation at the SAM result from the combined effect 
of inhibitory fields produced by existing organs. As these organs are 
displaced away from the SAM by growth, new organs form sequentially 
at positions where the sum of the inhibitory effects is the lowest**. Both 
the position of new organs and the time delay between organ initiations 
(or plastochron) are emergent parameters of this dynamical system®. 
Strong evidence indicates that, notably through a network of polarly 
localized PIN-FORMED 1 (PIN1) efflux carriers, polar auxin transport 
not only controls accumulation of auxin at the site of organ initiation 
but also creates inhibitory fields around organs by auxin depletion*”"’. 
It has therefore been proposed that the auxin transport system could 
be sufficient to control both the spatial and the temporal dynamics 
of phyllotaxis’. 

Here we re-evaluated this proposition by considering the role of 
cytokinin during organ initiation in Arabidopsis thaliana. Cytokinin 
regulates the size of the stem cell niche (and thus of the SAM) and this 
can effect phyllotaxis by modifying the geometry of the SAM**"*. To 
explore a possible role of cytokinin directly in organ initiation, we re- 
analysed recent transcriptomic data for different domains of the SAM” 
and identified AHP6 (ref. 14) as the only candidate gene encoding a 
cytokinin signalling effector specifically enriched in organs (Fig. 1a, 
Supplementary Figs 1 and 2a). Using in situ hybridization, we con- 
firmed that AHP6 is specifically expressed during organ initiation and 


development (Fig. 1b, c, Supplementary Fig. 2b-h)'*. Wild-type Arabidopsis 
plants display spiral phyllotaxis, resulting in consecutive organs gen- 
erally distributed on the stem at a divergence angle close to a 137.5° cano- 
nical angle (noted as a; Fig. 1d). In contrast, both ahp6-1 and ahp6-3 null 
mutants as well as the ahp6-1/ahp6-3 (ahp6-1/3) trans-heterozygote 
showed obvious modifications in organ arrangements along the stem 
(Fig. le-g). We also observed supernumerary petals and sepals in flowers 
(Supplementary Fig. 3), indicating that AHP6 regulates phyllotaxis 
throughout inflorescence development. 

To further characterize the stem architecture of ahp6 mutants, we 
analysed sequences of successive divergence angles between organs on 
the inflorescence stems from a large population of ahp6-1 and wild-type 
plants. This analysis demonstrated a notable amount of non-canonical 
divergence angles in wild-type plants and a large increase in the occur- 
rence of such angles in most mutants (Supplementary Fig. 4a, b). Nota- 
bly, an ‘M-shaped’ motif corresponding approximately to the angle 
sequence 24, 360—«, 20 appeared much more frequently in ahpé than 
in wild-type sequences (Fig. 1h, iand Supplementary Fig. 4c-f). It was 
not associated with changes in the structure of the stem, such as twist- 
ing, that could modify angles between siliques (Supplementary Fig. 5). 
This motif can theoretically arise if two consecutive organs in a canonical 
sequence are permuted along the stem (Fig. 1j)’’. By applying a stochastic 
and a combinatorial model to analyse the divergence angle sequences 
(see Methods), we showed that over 95% of the non-canonical angles 
can indeed be explained by permutations of the insertion order of 2 to 
3 organs in both wild-type and ahp6 plants (Fig. 1k, land Supplementary 
Fig. 4c-f). We further demonstrated an increase by 2.4-fold and 17.6- 
fold of permutations involving 2 and 3 organs, respectively, in ahp6 
mutants compared to wild-type plants (Fig. 1k). Altogether, our data 
indicate that AHP6 is required for buffering an intrinsic instability of 
phyllotaxis leading to permutations in the order of organ insertions 
along the stem. 

We next used scanning electron microscopy (SEM) to study the 
geometry of the ahp6 mutant SAM. In wild-type meristems the spatial 
organization of the organs usually followed the expected phyllotaxis 
and consecutive organs showed clear differences in size (Fig. 2a). In 
contrast, pairs or triplets of young organs at quasi-identical develop- 
mental stages occurred in most ahp6 meristems (Fig. 2b). This obser- 
vation was confirmed using a LEAFY (LFY) promoter driving GFP 
(pLFY::GFP) flower-specific marker line (Fig. 2c, d), demonstrating that 
the loss of AHP6 leads to simultaneous development of flowers instead 
of the mostly sequential outgrowth observed in wild-type meristems. 
As the size of the stem-cell niche and of the meristem were not signifi- 
cantly affected by the ahp6 mutation (Supplementary Fig. 6), these data 
indicate that AHP6 is required for regulating the sequence of organ 
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Figure 1 | AHP6 is expressed in lateral organs and regulates inflorescence 
architecture. a, AHP6 (AT1G80100) expression in genome-wide data for 
lateral organs (FILp), stem cells (CLV3p) and the WUSCHEL domain 
(WUSp)”. An asterisk indicates statistically different from FILp (Methods). 
Dashed line indicates non-specific background level. Expression measured 
in arbitrary units (a.u.). b, c, AHP6 in situ hybridization on section (b) 

and whole-mount tissues (c). d-g, Wild-type (d, Col-0) and ahp6 

(e-g) inflorescences. Arrowheads (d-g) indicate perturbations. 

h, i, Representative angle sequences for wild type (h, n = 82) and ahp6-1 

(i, n = 89). Asterisk indicates M-shaped motif. j, Explaining the M-shaped 
motif: canonical insertion order (left), the one obtained by permutating two 
organs (centre) and angle sequences (right). k, Quantification of permutations. 
Wild type (WT). 1, Permutations in the ahp6-1 sequence from (i). 


initiation and/or organ growth following initiation to restrict concom- 
itant development of organs. 

To discriminate between these possible scenarios, we followed the 
expression of a nuclear-localized DR5:: VENUS? over several days in 
ahp6 and wild-type meristems. The synthetic auxin-inducible DR5 
reporter allows monitoring of the recruitment of organ founder cells 
starting from initium il and in primordia from early P1 onwards (Sup- 
plementary Fig. 7)*°""”°. Live imaging of DR5:: VENUS demonstrated 
that, although the mean rate of organ initiation (the mean plastochron) 
is comparable in wild-type and ahp6 (Supplementary Fig. 8), the loss of 
AHP6 results in a strong increase in concomitant organ initiation and, 
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at a very low frequency (8 out of 255 initiation events), in permutations 
in the order of organ initiation (Fig. 2e-g and Supplementary Fig. 9). 
On the contrary, the relative angle between organs and the position of 
their initiation relative to the centre of the meristem were unaffected by 
the ahp6 mutation (Fig. 2h, i). In addition, co-initiated organs were also 
generated at a similar radial distance from the centre independently of 
the genotype and with a standard deviation smaller than one cell dia- 
meter (Supplementary Fig. 10a)*!. The rate of organ displacement away 
from the centre of the SAM was also similar between ahp6 and wild- 
type, indicating that AHP6 does not influence growth (Supplementary 
Fig. 10b). Thus, although organs are initiated at a precise radial dis- 
tance and with a precise divergence angle, frequent organ co-initiations 
are observed, indicating an irregular plastochron. AHP6 is then required 
to stabilize the plastochron by limiting organ co-initiations. Our data 
further indicate that organ permutations on the stem result mostly from 
organ co-initiations. Also, the frequency of perturbations in the sequence 
of organ initiation at the SAM is higher than the frequency of permuta- 
tions on the stem in both wild-type and ahp6 (28% and 47% compared 
to 10% and 25% for wild-type and ahpé, respectively). This indicates 
that co-initiated organs are sorted when the internode is established 
either in a normal or in an inverted order, only the latter resulting in 
permutations on the inflorescence stem. 

As auxin activates directly AHP6 transcription in root tissues”, we 
next investigated whether the spatiotemporal pattern of AHP6 tran- 
scription in the SAM could be controlled by auxin. Co-visualization of 
a pAHP6::GFP transcriptional reporter that recapitulates AHP6 expres- 
sion pattern (Fig. 3a) and DR5:: VENUS showed that AHP6 is activated 
one plastochron after DR5 activation (Fig. 3b-d and Supplementary 
Video 1). AHP6 expression in the SAM was also lost in monopteros 
(mp), a mutant in a major transcriptional effector of auxin signalling in 
the SAM (Fig. 3e)”*. In addition, using electrophoretic mobility shift 
assays (EMSAs), we mapped binding of MONOPTEROS to three out 
of six locations of putative ARF binding sites in the AHP6 promoter 
(Fig. 3f). This indicates that AHP6 is activated downstream of auxin by 
MONOPTEROS. 

The temporal delay between DR5 and AHP6 activation, together with 
the fact that AHP6 is required for the earliest steps of organ initiation, 
indicates that AHP6 acts non-cell-autonomously on the temporal sequence 
of organ initiation. We thus monitored the distribution in the SAM of 
a functional AHP6-GFP protein fusion expressed under the endogen- 
ous AHP6 promoter™. We observed sharp gradients of AHP6-GFP 
centred on primordia and extending beyond their boundaries, indica- 
ting that intercellular movement of the protein creates fields of AHP6 
around organs (Fig. 3g). AHP6—-GFP fluorescence quantification further 
demonstrated that AHP6 movement creates a non-cell-autonomous 
differential in AHP6 levels between the predicted il and i2 sites (Fig. 3h, i; 
Supplementary Fig. 11), AHP6 levels being 1.47-fold higher (+0.32 
with n = 12 meristems; t-test: one-sided P = 2 X 10 *) at the i2 site. 
Fluorescence profiles taken through il or i2 from the two closest pri- 
mordia (P3 and P5 or P2 and P4, respectively) further indicate that the 
proximity of P2 and to a lesser extent P4 allows for the higher AHP6 
level at the i2 site (Fig. 3j and Supplementary Fig. 12). These profiles also 
allowed visualization of changes in AHP6 levels produced by prim- 
ordia; AHP6 levels increase first and then decrease strongly from P4 
onwards (Fig. 3h, j). AHP6 levels at il are then lower owing not only to 
an increased distance between il and P3 (P2 being further away), but also 
to lower AHP6 levels produced by P5. Thus, the differential in AHP6 
levels between the il and i2 sites, that could also be visualized in plants 
co-expressing AHP6-GFP and DR5::VENUS (Fig. 3k), results from 
both the geometry of the SAM and dynamic changes in AHP6 levels 
during flower primordia development. AHP6 movement could be blocked 
by fusing AHP6 to a triple- VENUS (3 VENUS; Fig. 31 and Supplemen- 
tary Fig. 13a-c). The AHP6-3 X VENUS protein expressed under the 
endogenous AHP6 promoter (pAHP6::AHP6-3X VENUS) was still func- 
tional because it could complement the cell-autonomous loss of proto- 
xylem phenotype in ahp6 roots (Supplementary Fig. 13d-g)'*. However, 
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morphologically identical stages. c, d, pLFY::GFP expression in wild type 

(c) and ahp6-1 (d). Numbers within the brackets indicate number of cells. 
e-i, Organ initiation timing and position in wild type (n = 20) and ahp6-1 

(n = 33) expressing DR5::VENUS. Representative time courses (e, wild type; 


in contrast to pAHP6::AHP6- VENUS, pAHP6::AHP6-3X VENUS could 
not complement the ahp6-1 inflorescence phenotype (Fig. 3m). This 
suggests that AHP6 movement in the SAM generates a differential in 
AHP6 levels between il and presumptive i2, which is required for 
sequential organ initiation. Note that a narrow region with a lower level 
of AHP6 was often observed in the vicinity of the expected i2 position, 
although the proximity of P2 and P4 limited its size compared to il 
(Fig. 3h, j and Supplementary Fig. 12). Thus, slight errors in i2 position- 
ing can almost abolish the i1/i2 AHP6 differential. This could explain 
the occurrence of co-initiations even in wild-type meristems. 

As AHP6 functions as a cytokinin signalling inhibitor in the root", 
its spatial distribution in the SAM could create differential cytokinin 
signalling capacities between il and presumptive i2. Indeed, comparison 
of wild-type, ahp6 mutants and 35S::AHP6 plants (Fig. 4a) showed that 
AHP6 levels negatively modulate the induction by cytokinin of several 
primary cytokinin response genes’***”* in shoot tissues. Expression of 
the cytokinin-inducible synthetic reporter TCS::GFP”° was also extended 
in ahp6 meristems (Fig. 4b, c), indicating that AHP6 negatively regulates 
the spatial distribution of cytokinin signalling in the SAM. In addition, 
we found that TCS::GFP and DR5::VENUS were activated together 
during organ initiation in partly overlapping domains in wild-type plants: 


Divergence angle 


f, ahp6-1); organ co-initiations/permutations (g); relative angles (h); 

and radial position of initiation (i). Primordia (P) numbered from youngest to 
oldest, initia (i) from oldest to youngest. Asterisks indicate co-initiated organs. 
Autofluoresence visible in red (c, d) or grey channels (e, f). Scale bars, 

50 pm (a-d) or 20 um (e, f). 


TCS::GFP expression was absent from presumptive i2 but started to be 
expressed at low levels in il and increased in P1 onwards, thus demon- 
strating a progressive activation of cytokinin signalling (Fig. 4d, f-h, 1). 
In ahp6 meristems, TCS::GFP was on the contrary already expressed in 
i2 at a level similar to that observed in P1 in wild type and increased 
further compared with wild type from il onwards (Fig. 4e, i-k, 1), indi- 
cating that AHP6 protein distribution regulates the spatiotemporal 
pattern of cytokinin signalling during organ initiation. Our results thus 
show that AHP6 distribution in the SAM creates a differential in cyto- 
kinin signalling between il (higher cytokinin signalling) and presump- 
tive i2 (lower cytokinin signalling). 

The negative correlation between AHP6 protein levels and DRS levels 
in il and presumptive i2 suggests that the differential in cytokinin sig- 
nalling generated by AHP6 could act by modulating PIN-regulated 
auxin transport, as observed in root tissues’”*®*’, However, we could 
not detect any changes in PIN] levels or intracellular localization or in 
PIN] polarity distribution in the SAM of ahp6 mutants (Supplementary 
Fig. 14). This indicates that AHP6 is unlikely to affect auxin transport 
in the SAM but rather regulates organ initiation after organ position- 
ing by auxin, by acting either in parallel with or downstream of auxin. 
Consistent with this hypothesis, the downregulation of several cytokinin 
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Figure 3 | AHP6 acts non-cell-autonomously downstream of auxin. 

a, pAHP6::GFP. b, pAHP6::GFP (green) and DR5:: VENUS (magenta; n = 6). 
c, d, Longitudinal optical sections at il (c) and P1 (d). e, AHP6 expression in 
mp-S319. f, Electrophoretic mobility shift assay (EMSA) using MONOPTEROS 
DNA-binding domain (MP-DBD). Cartoon shows putative ARF binding 
sites in AHP6 promoter (1.6 kb). g-k, AHP6 protein distribution. 

g, pAHP6::AHP6-GFP (n = 12; inset shows original image). h, Close-up of 
(g). i, i2/i1 AHP6-GFP ratios. j, Fluorescence distribution along areas 

shown in (h). k, pAHP6::AHP6-GFP (green) and DR5:: VENUS (magenta). 

1, pAHP6::AHP6-3X VENUS (inset: pAHP6::AHP6- VENUS). m, Boxplots of 
variance per individual of divergence angles between siliques in wild type 

(n = 19), ahp6-1 pAHP6::AHP6-VENUS (n = 23, 19 and 17, respectively), 
ahp6-1 pAHP6::AHP6-3X VENUS (n = 17, 17, 18 and 20, respectively) and 
ahp6-1 (n = 18). P values: two-sided Kruskal-Wallis. Autofluorescence visible 
in grey (a-d, 1). Scale bars, 50 um. 


signalling inhibitors can partially restore organ initiation in the auxin 
signalling deficient mp mutant”*. A plausible scenario would then be 
that lower levels of AHP6 in il compared to presumptive i2 promote il 
initiation. Inversely, higher concentration of AHP6 in i2 would repress 
cytokinin signalling and organ initiation, allowing for a time delay between 
il and i2 initiations. 

In conclusion, our results indicate that, although the spatial position 
of new organs at the SAM is robustly determined by the auxin-based 
inhibitory fields**’"”’, the dynamics of these fields leads to a noisy plas- 
tochron. Accordingly, a recent theoretical study demonstrated that noise 
induces principally irregularities of the plastochron in a phyllotactic 
model”. Our results further indicate that the noise on the plastochron 
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Figure 4 | AHP6 inhibitory fields generate patterns of cytokinin signalling 
in the meristem. a, RT-PCR of ARABIDOPSIS RESPONSE REGULATORS 
(ARRs) induction by cytokinin (100 nM BAP) in wild-type, ahp6-1 and 
35S::AHP6 (overexpression, OE). P value shows two-sided ANOVA (n = 3), 
*P = 0.05. Error bars show s.d. b, c, TCS::GFP in wild-type (b; WT: n = 10) 
and ahp6-1 (c; n = 9). d-k, DR5:: VENUS (magenta) and TCS::GFP (green) 

in wild type (d, f-h) and ahp6-1 (e, i-k). Longitudinal optical section with 
both merged and separate channels in i2, il and P1 are shown (WT: f-h; 
ahp6-1: i-k). Arrowheads indicate TCS::GFP in primordia; 1, TCS::GFP 
quantification. P values from one-sided Mann-Whitney U-test (WT: n = 5; 
ahp6-1: n = 8; *P = 0.05). Error bars represent s.d. Grey autofluorescence 
visible (d, e). Scale bars, 50 um (b-e), 10 um (f-k). 


is then filtered out in part by non-canonical inhibitory fields generated 
by AHP6 movement downstream of the primary auxin fields. In this 
scheme, plastochron robustness would be increased by conversion of 
the spatial information provided by cytokinin signalling inhibitory 
fields into a roughly periodic temporal sequence of auxin-induced orga- 
nogenesis (Supplementary Fig. 15). 


The ae fheilhiee Coban (Col-0) ecotype was used except for pWUS:: 
GFP and pLFY::GFP (Ws). TCS::GFP was generated by transformation of a previ- 
ously described plasmid”*. pAHP6::AHP6-VENUS/3X VENUS and 35S::AHP6 were 
constructed by Gateway recombination, using a 2,494-bp AHP6 genomic frag- 
ment and full-length cDNA respectively, and introduced in plants. 

RNA in situ hybridization and SEM were performed as described*"’, with minor 
modifications for the whole-mount RNA in situ hybridization. To analyse xylem 
defects, roots were cleared with chloral hydrate. Light microscopy images were 
obtained using either transmission or laser-scanning confocal microscopes. Cul- 
ture and live-imaging of SAM were performed as previously described”, except for 
adding 555 nM BAP to the culture medium. Images were processed and analysed 
using Image J (http://rsbweb.nih.gov/ij/). The PIN1 network was analysed from 
PIN1 whole-mount immunolocalization as described’* and by computing (under 
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Python) a PIN1 polarity coherence index for each cell to estimate the local coher- 
ence of PIN1 orientations. 

Real-time polymerase chain reaction (RT-PCR) was performed using the SYBR 
green reagent kit (Roche) on shoots of 7-day-old seedlings as described'’. Micro- 
array data were analysed under R using GC robust multi-array average (gcRMA) 
to estimate expression, LIMMA for statistical tests and Q-values to correct for mul- 
tiple testing. For other data, statistical analyses were done using t-tests (or ANOVA 
for multiple comparisons) only when normal distributions and homoscedasticity 
of the response variables were verified, and non-parametrical Mann-Whitney tests 
used (or Kruskal-Wallis tests for multiple comparisons) otherwise. 

EMSA was performed using 30-40-bp double-stranded oligonucleotides labelled 
with Cy5 and corresponding to the AHP6 promoter sequence with or without 
mutations of putative ARF binding sites. 

To identify permutations in the sequences of divergence angle measured on 
inflorescences, a combinatorial mixture model and a hidden Markov chain model 
were used (see Methods). 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Plant material and growth conditions. The ahp6-1, ahp6-3, mp-S319, pLFY::GFP, 
pWUS::GFP, pAHP6::GFP, pPIN1::PIN1-GFP, pAHP6::AHP6-GEFP, pARR15::GEP, 
DR5::GEP and DR5::VENUS lines have been described'*"*7°***'*? and are all in 
Col-0 except pLFY::GFP and pWUS::GFP (Ws). All transgenic plants (see below) 
were generated in the Col-0 ecotype. Plants were grown in vitro on Murashige and 
Skoog (MS) medium or on soil at 20°C in short-day conditions (8h light/16h 
darkness) for 4 to 5 weeks to synchronize them before being transferred in long- 
day conditions (16h light/8h darkness). Plants showing obvious developmental 
defects owing to adverse growth conditions were systematically discarded. Plants 
for live-imaging or for the analysis of phyllotactic sequences were always grown 
together at the same place within the growth chamber and with randomized posi- 
tion within each tray in order to minimize the effect of environmental fluctuations. 
Cloning and constructs. A previously described plasmid” was introduced into 
plants by floral dipping** to generate the TCS::GFP line. To generate 35S::AHP6, an 
AHP6 full-length complimentary DNA (cDNA) (encompassing a 359-bp 5’ UTR 
upstream of the ATG and an 83-bp 3’ UTR after the stop codon) obtained by 5’ 
RACE-PCR was recombined into the Gateway binary vector dpGreenKanT under 
the control of the Cauliflower Mosaic Virus (CaMV) 35S promoter. dpGreenKanT 
was obtained by inserting a Nos terminator after the attR3 recombination site in 
Kanamycin-resistant version of the Gateway-compatible pGreen 0229 plasmid**”*. 
The pAHP6::AHP6-3X VENUS construct was generated by recombining in phase 
a 2,429-bp AHP6 genomic fragment (from 1,594 bp upstream of the ATG to the 
stop codon), a 2X VENUS without stop codon and VENUS with stop codon into a 
gateway binary vector pK7m34GW (ref. 37). The plasmids were then transformed 
into Col-0 and ahp6-1 plants, respectively, by floral dipping. 

Real-time RT-PCR and microarray data analysis. The real-time RT-PCR ana- 
lysis was designed to comply with standards of RT-PCR*™”” and performed on a 
StepOne Plus cycler (Applied Biosystems) using the SYBR green reagent kit (Roche). 
The 7-day-old plants grown in vitro were transferred for 24h on plates containing 
either mock or 100 nM BAP before extraction of mRNA specifically from the shoot 
tissues using the Spectrum Plant total RNA kit (Sigma). The primers used are listed 
in Supplementary Table 1. The TCTP gene was used as a standard (Supplementary 
Table 2) and validated using BestKeeper*®. PCR efficiency was calculated for each 
primer pairs using classical calibration dilution curve and used for the estimation 
of the ACt. Three biological replicates were tested and reactions were carried out 
using three technical replicates. The effect of the treatment was calculated as the 
difference between the ACt in the BAP-treated samples and the mock-treated 
samples (AACt). Statistical significance of the results was tested using an analysis 
of variance (ANOVA) performed with R (http://www.r-project.org). 

For microarray data analysis, expression estimates were calculated from the 

raw cell files’” obtained from ArrayExpress using GC robust multi-array average 
(gcRMA)*". Statistical testing for differential expression between the different data 
sets was performed with LIMMA™”. Correction for multiple testing was done by 
computing Q-values*’. These analyses were done under R using Bioconductor pack- 
ages (http://www.bioconductor.org). By fixing the false discovery rate (FDR) to 1, 
we identified the following number of genes differentially expressed: FILp-CLV3p: 
1379; FILp-WUSp: 1165; CLV3p-WUSp: 1357. Genes affected by protoplasting”” 
(592 genes identified) were not considered in the analysis. 
Electrophoresis mobility shift assays. The B3 DNA-binding domain of ARF5 
(residues 120-274) was cloned into the vector pETM-11 (ref. 44). The protein was 
produced in Escherichia coli strain Rosetta 2 (Novagen) and purified on nickel 
sepharose high performance media (GE Healthcare) in a buffer containing 20 mM 
Tris-HCl pH8.0 and 0.5M NaCl. For EMSA, single-stranded oligonucleotides 
were annealed to complementary oligonucleotides in annealing buffer (10 mM 
Tris-HCl pH7.5, 150 mM NaCl and 1mM EDTA). The dsDNA (4 pmol) with a 
protruding G was fluorescently labelled with Cy5-dCTP (8 pmol) (GE Healthcare) 
using 1 unit of Klenow fragment polymerase (Ozyme) in 1X Klenow buffer for 1h 
at 37 °C. Binding reactions were performed with 1.5 1M of purified B3 domain and 
25 nM of labelled dsDNA in 20 ul of binding buffer (20 mM HEPES-NaOH pH 7.9, 
50 mM KCl, Tris-HCl 100 mM pH 8.0, 1% glycerol, 56 jig fish sperm DNA (Roche)). 
Binding reactions were loaded onto native 6% polyacrylamide gels and electro- 
phoresis was conducted at 90 V for 75 min at 4 °C in0.5X TBE (45 mM Tris, 45 mM 
boric acid and 1 mM EDTA pH 8.0). Gels were further scanned on a Typhoon 9400 
scanner (excitation light 649 nm, emission filter 670 nm band-pass filter (670 BP 
30); Molecular Dynamics). 

The following oligonucleotides were used: TCA (control) 5'-ATACACGCAAT 
GTCTCCCTTTTGTCTCTTCCAC-3’; site A 5’-GCAAAGAAGCATGACATA 
CGAATGAGACAATTTCAGTTTT-3’; site A mutated 5’-GCAAAGAAGCAT 
GCCATACGAATGAGCCAATTTCAGTTTT-3’; site B 5’-GITTATATGATTAT 
AACTTGACAGACCAAATAATCATCTTA-3'; site B mutated 5’-GITATATG 
ATTATAACTTGCCCGCCCAAATAATCATCTTA-3’; site C 5’-AGCTGGTC 
TGACAGGGTACGCCGGTTGTCGGGAGGAAGAA-3’; site C mutated 5’-AG 


CTGGTCTGCCAGGGTACGCCGGTTGGCGGGAGGAAGAA-3’. The putat- 
ive ARF binding sites are highlighted in bold and the mutations are underlined. 
Cytology and cell biology. To analyse xylem defects in ahp6-1 mutants expressing 
AHP6-GFP, AHP6-VENUS and AHP6-3 X VENUS, roots of 5-day-old seedlings 
were cleared with chloral hydrate. RNA in situ hybridization on sections was 
performed as described’ using full-length probes amplified by PCR. Whole-mount 
RNA in situ hybridization was performed as described* except that, before the 
treatment with proteinase K, permeability of the tissues was increased by digesting 
cell walls for 4min at room temperature with an enzyme mix containing 0.5% 
Macerozyme R10 (Yakult Honsha), 0.5% cellulase RS (Yakult Honsha), 0.25%, 
pectolyase (Sigma), 0.75% pectinase (Serva) in water further diluted 6 times in PBS 
with 0.1% Tween20. 

For PIN1 whole-mount immunolocalization, inflorescences were first fixed in 
FAA (5% formaldehyde, 50% ethanol and 10% acetic acid) for 1 h at room tem- 
perature (as all subsequent steps, unless specified) and dehydrated by a serial 
change of 70%, 90% and 100% ethanol (10 min each). Samples were then rehy- 
drated by a serial change of 90%, 70%, 50%, 30% ethanol in microtubule-stabilising 
buffer (MTSB: 5mM EGTA, 5mM MgSO4, pH 7) plus 0.1% Triton and equili- 
brated with two washes of 10 min in MTSB. The enzyme mix used for whole- 
mount in situ RNA hybridization (see above) was diluted four times in MTSB and 
cell wall digestion was carried out for 45 min. Samples were then washed 3 times 
for 10 min in MTSB plus 0.1% Triton. Before antibody application, samples were 
pre-treated for 1 h in MTSB, 10% DMSO, 3% NP-40 then 1 hin solution A (MTSB, 
0.1% Triton, 3% BSA from Sigma). Samples were then incubated overnight at 4 °C 
with the primary antibody (Ap20 anti-PIN1 antibody (Santa Cruz), dilution 1:100 
in solution A), before being washed 4 times for 10 min in MTSB plus 0.1% Triton. 
Samples were then incubated with the secondary antibody (Alexa 488 antibody 
from Invitrogen, dilution 1:500 in solution A) for 3 h at 37 °C. The excess of anti- 
body was then removed by 5 washes of 10 min in MTSB plus 0.1% Triton followed 
by 5 washes in water (10 min each). Samples were then observed using confocal 
microscopy. 

Microscopy and live imaging. SEM was performed as previously described’. 
Cleared roots and RNA in situ hybridizations were observed with a transmission 
microscope under brightfield or differential interference contrast (DIC) illumina- 
tion (Axio Imager 2, Zeiss). Confocal microscope observations were done either on 
a LSM-510 laser-scanning confocal microscope (Zeiss), a confocal spinning disc 
DMI400 microscope (Leica), a SP5 spectral detection confocal microscope (Leica) 
or ona LSM-710 spectral laser-scanning confocal microscope (Zeiss). Images were 
processed using ImageJ (http://rsbweb.nih.gov/ij/). Serial sections were used to 
count the number of cells expressing pLFY::GFP in the younger primordia. Culture 
and imaging of living SAMs expressing DR5:: VENUS was performed as described 
previously*® except for adding 555nM N6-Benzyladenine (BAP; Duchefa) in the 
culture medium. The meristems were allowed to recover for 12h before starting 
imaging every 12h for 72 h. New primordia were scored as co-initiated when they 
exhibited similar number of VENUS-positive nuclei at a new time point. 
Measurements and image analysis. All measurements and image analyses were 
done using ImageJ (http://rsbweb.nih.gov/ij/). Meristem width was measured on 
maximum-intensity projections of confocal serial images of living meristems after 
staining with FM4-64 (ref. 46). For each meristem, the youngest primordium P(n) 
separated from the meristem by a clear crease was selected. Then, the width was 
defined as the distance from this boundary to the opposite side of the meristem, 
located between P(n—1) and P(n—4) (See Supplementary Fig. 6a, b). 

For measuring the size of the expression domain of GFP markers (p WUS::GFP, 
DR35::VENUS) in the meristem, images for a given marker were thresholded using 
the same fluorescence intensity range and transformed into binary images. The 
measure of the area of the central DR5::GFP negative region (see Supplementary 
Fig. 6i, j) was estimated by fitting the largest possible disc between existing prim- 
ordia expressing DR5::GFP. 

The minimal distance of organ initiation to the meristem centre and divergence 
angle at organ apparition was established using the coordinates of the meristem 
centre and coordinates of the centre of the initium as soon as the first DR5:: VENUS- 
positive nuclei appeared during the kinetics. Organ displacement was deduced from 
the evolution of the organ-to-meristem centre distance during the time-lapse 
imaging. In this analysis, the centre of the meristem was defined as the minimal 
variation centre of the phyllotactic pattern as described’. To obtain the coordi- 
nates of the centre in a given image, we either used the FindCenter program (http:// 
www.math.smith.edu/phyllo//Research/findcenter/findcenter.html) or we deter- 
mined it manually by finding the optimal position giving divergence angles between 
all organs as close as possible to 137.5°. Both methods gave similar results, but we 
found that the manual determination was generally more robust. All distances were 
calculated on projections of serial confocal sections without taking into account the 
z coordinates. 


©2014 Macmillan Publishers Limited. All rights reserved 


The geometric determination of il and i2 positions on meristems was done first 
by determining the centre of the meristem as described above on a projection of 
serial confocal sections. Then the centres of all primordia from P4 till the oldest 
available (typically P8 to P14) were determined. The distance to the centre was 
then computed for all primordia in order to calculate the plastochron ratios (PR, 
which is the ratio of the distance to the centre of two consecutive primordia—older 
divided by younger). A mean PR was calculated for each plant. Then, positions of 
P3 to i2 were successively calculated at a 137.5° divergence angle from the previous 
organ (following the handedness of the spiral for the plant analysed) and using the 
mean PR of the plant to set the distance to the meristem centre. All calculations 
were performed using GeoGebra (http://www.geogebra.org/cms/fr/). Note that 
the fit of calculated positions of P3-P1 primordia with their actual positions pro- 
vided an internal quality control for this geometrical modelling. In addition, the 
high predictive capacity of this method for positioning il was further demon- 
strated using 16 DR5::VENUS meristems (Supplementary Fig. 11). 

For quantification of fluorescence intensities over a region (TCS::GFP or 

AHP6-GFP), we used summation-intensity projections in the region of interest 
and calculated the raw integrated density of the fluorescence. For TCS::GFP, we re- 
sliced the confocal stacks to obtain longitudinal slices of the region corresponding 
i2, il and P1. Fluorescence levels were then measured only from cells in L1 and L2. 
AHP6-GFP fluorescence levels at the predicted il and i2 sites (see above) were also 
calculated only from L1 and L2 cells over the area of a circle of 100 jum” centred on 
the predicted position of il and i2 (this corresponds approximately to the size of il 
when detected with DR5::VENUS). Profiles of AHP6 distribution were obtained 
using the ‘Plot Profile’ function of ImageJ along the regions indicated on Fig. 3h 
and Supplementary Fig. 12. 
PIN] transport network analysis. To map PIN1 in the meristems using immu- 
nolocalization images, cells were segmented using the Merryproj and Merrysim 
softwares** on the projection of a confocal stack. Influence zones were analysed as 
described’. STSE software was used to process segmentations and PIN1 orienta- 
tions in order to obtain influence zones, and STSE and PlantGL were used to gene- 
rate the colour maps*”*”. 

To calculate the PIN] polarity coherence index for a given cell, we computed all 

unit vectors pointing from the centre of this cell to the neighbouring cells with a 
side facing PIN1 proteins. We averaged all these vectors, and normalized the 
average, so as to obtain a unit vector defining the cell PIN1 polarity, which is 
representative of the direction in which auxin is transported in the cell of interest. 
To analyse the coherence of auxin transport, we defined for each cell the average of 
the polarity vectors of all neighbouring cells (including the cell of interest). The 
coherence index is the norm of this final average polarity vector. The coherence 
index has the value 1 when all cells have the same polarity, and a low value if pola- 
rities are very different. In order to generate a control, we replaced the measured 
PIN] distribution by a distribution in which PIN1 proteins in a given cell are real- 
located randomly to the other sides of each cell (and keeping the same number of 
sides carrying PINI so that connectivity remains the same). We recomputed the 
coherence index for this ‘random’ PIN] distribution. Scenarios and index calcula- 
tions were implemented using Python. 
Measures of phyllotactic sequences. Measures of the sequences of silique diver- 
gence angles was performed as described*'. For each phyllotactic measurement, 
several plants of the different genotype tested were grown in parallel (always inclu- 
ding Col-0 and ahp6-1 control individuals). For each individual of each genotype, 
the variance of the divergence angles was computed and individual variances of 
divergence angles were compared between genotype using a non-parametric Kruskal- 
Wallis test under R, as their distributions were not normally distributed. 

Analyses of ahp6-1 pAHP6::AHP6-VENUS and ahp6-1 pAHP6::AHP6-3X VENUS 
were performed on T2 transformants which were hemizygote or homozygote for 
the transgene. 

Models used for characterization of permutation patterns. To investigate the 
presence of particular motifs in phyllotactic sequences, we pooled four independ- 
ent experiments of measurements with reproducible results, providing a data set of 
82 wild-type and 89 ahp6-1 plants. An exploratory analysis highlighted two char- 
acteristics of the divergence angle sequences: (1) the existence of short segments of 
non-canonical divergence angles along measured sequences (Fig. 1h, i, Supplemen- 
tary Fig. 4c—f); and (2) almost all the possible angle values between 0 and 360° were 
observed with highest frequencies around the canonical angle « (Supplementary 
Fig. 4a). At least four classes of divergence angles were apparent but they were not 
unambiguously separated. To test if the segments of non-canonical angles could 
be explained by permutations and given the noisy character of the measurements, 
we designed a stepwise modelling approach***? with two objectives: (1) to identify 
permutation patterns; and (2) optimally label the measured divergence angle sequences. 

Ina first step, a stationary hidden first-order Markov chain was estimated on the 
basis of the pooled measured divergence angle sequences (171 sequences repre- 
senting a cumulative length of 5,220 angles). In this hidden first-order Markov 
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chain, the states of the non-observable Markov chain represents ‘theoretical’ 
divergence angles whereas von Mises observation distributions attached to each 
state of the non-observable Markov chain represents measurement uncertainty. 
The von Mises distribution” is a univariate Gaussian-like periodic distribution for 
a variable xe[0,360°). The von Mises observation distributions estimated for the 
five states of the non-observable Markov chain were centred on the multiples of 
the canonical divergence angle «, 20, —a, 30%, —20 (see Supplementary Table 2). 
The permutation of 2 consecutive organs generates the divergence angles 24, —a 
and 3a. The identification of —2c using this five-state model suggested the occur- 
rence of permutations involving 3 organs in the measured sequences. Ifin addition 
to 2-permutations, 3-permutations are considered, the divergence angles — 20, 40 
and 5a are expected to be observed. As the standard deviations of these von Mises 
observation distributions were quite similar, particularly for the most represented 
states corresponding to o, 20 and —«, we chose to estimate a five-state hidden first- 
order Markov chain in which the von Mises observation distributions share the 
same concentration parameter (inverse variance). The optimally labelled diver- 
gence angle sequence (that is, discrete sequence with five possible values chosen 
among 0, 20, —%, 3%, —2«) was then computed for each observed sequence using 
the estimated hidden first-order Markov chain. 

Ina second step, the memories of a variable-order Markov chain were optimally 
selected** on the basis of these labelled divergence angle sequences. This can be 
interpreted as a way to identify local dependencies between successive divergence 
angles. For the selection of these memories, we chose to discard the individuals that 
were very poorly explained by the estimated hidden first-order Markov chain (10 
individuals out of 171 whose posterior probability of the optimally labelled diver- 
gence angle sequence <0.13). The variable-order Markov chain was a mixed first-/ 
second-order Markov chain where the first-order memory 2« was replaced by the 
four second-order memories 020, 2020, —«2a, —20%2% (the memory 302% was 
not observed) with respect to a simple first-order Markov chain. This means that 
to predict accurately the most frequent permutation patterns, it is only necessary 
to take into account the divergence angle that precedes 2a. This is illustrated by the 
building of the 2-permutation pattern [20 ~ e201] as a succession of memories 
with high transition probabilities: «2% > —a > —a2a (Supplementary Table 3) 
instead of 2° —o"> 2a witha simple first-order Markov chain. Finally, a hidden 
variable-order Markov chain was estimated where the underlying variable-order 
Markov chain has the memories previously selected. 

One advantage of hidden Markov models is the capability to compute an abso- 
lute measure of the relevance of the optimally labelled divergence angle sequence 
as a posterior probability (that is, weight of this optimally labelled divergence angle 
sequence among all the possible labelled divergence angle sequences that can explain 
a given observed sequence). One shortcoming of hidden Markov models is that 
some multiples of the canonical divergence angle that occur rarely (for example, 
401, 50) as well as alternative phyllotaxis (for example, Lucas with a canonical diver- 
gence angle of 99.5°) cannot be modelled. To be able to further investigate these 
sequences, we used a combinatorial mixture model as described”. The final results 
are then a consensus deduced from the divergence angle sequence optimally labelled 
by the hidden variable-order Markov chain and the combinatorial mixture model 
(Fig. 1k). 
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Chasing acyl carrier protein through a catalytic cycle 


of lipid A production 


Ali Masoudi', Christian R. H. Raetz't, Pei Zhou! & Charles W. Pemble IV?" 


Acyl carrier protein represents one of the most highly conserved 
proteins across all domains of life and is nature’s way of transport- 
ing hydrocarbon chains in vivo. Notably, type II acyl carrier proteins 
serve as a crucial interaction hub in primary cellular metabolism’ by 
communicating transiently between partner enzymes of the numer- 
ous biosynthetic pathways”*. However, the highly transient nature 
of such interactions and the inherent conformational mobility of 
acyl carrier protein’ have stymied previous attempts to visualize 
structurally acyl carrier protein tied to an overall catalytic cycle. 
This is essential to understanding a fundamental aspect of cellular 
metabolism leading to compounds that are not only useful to the 
cell, but also of therapeutic value. For example, acyl carrier protein 
is central to the biosynthesis of the lipid A (endotoxin) component 
of lipopolysaccharides in Gram-negative microorganisms, which is 
required for their growth and survival*’, and is an activator of the 
mammalian host’s immune system®”’, thus emerging as an import- 
ant therapeutic target* '°. During lipid A synthesis (Raetz pathway), 
acyl carrier protein shuttles acyl intermediates linked to its pros- 
thetic 4’-phosphopantetheine group” among four acyltransferases, 
including LpxD"™’. Here we report the crystal structures of three 
forms of Escherichia coli acyl carrier protein engaging LpxD, which 
represent stalled substrate and liberated products along the reaction 
coordinate. The structures show the intricate interactions at the 
interface that optimally position acyl carrier protein for acyl delivery 
and that directly involve the pantetheinyl group. Conformational 
differences among the stalled acyl carrier proteins provide the 
molecular basis for the association—dissociation process. An unan- 
ticipated conformational shift of 4’-phosphopantetheine groups 
within the LpxD catalytic chamber shows an unprecedented role 
of acyl carrier protein in product release. 

Although the paradigm for acyl carrier protein (ACP) association 
with protein partners is thought to be exceedingly transient, the LpxD 
acyltransferase in the Raetz pathway (Supplementary Fig. 1a) binds 
ACP with very high affinity (Kqa=59nM)". LpxD transfers R-3- 
hydroxymyristoyl (B-OH-C14) acyl chains that are delivered by ACP 
to the amino group of uridine diphosphate 3-O-(B-OH-C14)-c-p- 
glucosamine (UDP-acyl-GlcN)” (Supplementary Fig. 1b). Moreover, 
LpxD follows an ordered sequential kinetic mechanism in which 
acyl-ACP binds first and, importantly, holo-ACP (acyl chain has been 
liberated) dissociates last'*. Given the canonical mode of ACP-mediated 
transfer often allows the rapid exchange between protein partners”, 
such pronounced association between ACP and LpxD is most unusual 
and instead suggests a ‘strong, transient’ interaction that would require 
a yet to be identified ‘molecular trigger’ for dissociation. Thus, we 
reasoned that the crystallographic study of the ACP-LpxD complex 
offers a unique opportunity to gain the detailed molecular basis for 
ACP-based acyl delivery and a deeper understanding of protein-protein 
communication more generally. 

We present here three X-ray co-crystal structures of ACP bound to 
LpxD, each of which captured a different form of the carrier protein: 


intact-acyl-ACP, hydrolysed-acyl-ACP and holo-ACP (Fig. la, b, Sup- 
plementary Fig. 2 and Supplementary Table 1). These structures were 
resolved to 2.1, 2.9 and 2.15 A resolution, respectively (Supplementary 
Table 2). In each case, the triclinic unit cell contained two LpxD 
trimers with differing bound states of ACP (Supplementary Fig. 3). 
The overall B values for all ACPs modelled in each structure are two- to 
threefold higher than LpxD (Supplementary Fig. 4 and Supplementary 
Table 2) and, accordingly, the observed electron density was weaker in 
some regions (Supplementary Fig. 3). Nonetheless, the placement of 
side-chains at the protein-protein interface, prosthetic groups, and 
acyl chain(s) were apparent (Supplementary Fig. 2d-fand Supplemen- 
tary Fig. 5). Our structures show the 4'-phosphopantetheine group 
(4’-PPT) (attached to Ser 36) and its B-OH-C14 acyl chains having 
vacated the canonical hydrophobic cavity extending through the core 
of ACP*'*”—all of which require considerable movement (Supplemen- 
tary Fig. 6). The overall architecture of the LpxD trimer is similar to 
previously reported X-ray structures'*”” in that each monomer of LpxD 
can be subdivided into three domains (Fig. 1c and Supplementary Fig. 2): 
the amino (N)-terminal uridine-binding domain, which is tethered to 
the left-handed f-helix domain that harbours the conserved catalytic 
His 239 residue’*, and a carboxy (C)-terminal domain. 

The structures show three molecules of the carrier protein are loca- 
lized to the C-terminal end of LpxD (Fig. 1b). Notably, we have iden- 
tified the ACP recognition domain (ARD) (Fig. 1c), which is formed by 
the C-terminal domain and the last beta-coil of the left-handed B-helix 
domain, providing the molecular basis for ACP association. This con- 
trasts with a previous study that suggested the uridine-binding domain 
as the likely ACP docking site owing to its proximity to the catalytic 
cleft'®. Although the analogous C-terminal region of the LpxA acyl- 
transferase” is found to adopt a completely different orientation from 
that of LpxD, it may serve a similar function in binding ACP (Sup- 
plementary Fig. 7). 

By virtue of the complete engagement of ACP, three competent 
active sites are created (Fig. 2a). Each ACP-LpxD interface buries a 
surface area of approximately 530 A’ and is predominated by comple- 
mentary electrostatic interactions (Fig. 2b). In addition, van der Waals 
contacts and extensive interaction with the prosthetic group contri- 
bute to the large binding footprint that explains the ‘strong, transient’ 
nature of these two protein partners. A combination of residues 
located on the ‘universal recognition helix’ (helix II)”’, as well as por- 
tions of L1, L2 and helix-II of ACP, provide the acidic surface that 
binds a pronounced basic patch on LpxD. This surface feature of ACP 
can be subdivided into two highly acidic regions, I and II, which 
include residues Glu 30-Met 44 and Ala 45-Glu 60, respectively. The 
complementary binding surface on LpxD involves residues from all 
three monomers (denoted by prime symbols) and forms a shallow 
groove between coiled coils of the ARD into which helix-II packs 
(Supplementary Fig. 8). 

Within region I, Asp 35, Ser 36, Leu37, Asp 38, Val 40, Glu41 and 
Met 44 are important for binding the N-terminal end of the recognition 
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helix to the base of the ARD domain (Supplementary Fig. 8), and the 
interactions were notably present in all three structures. Region II of 
ACP interacts with the upper portion of the ARD domain, the details of 
which differ substantially among the three stalled ACP complexes 
(discussed below). Most of the residues within regions I and II are 
conserved among other type II carrier proteins (Supplementary Fig. 2b) 
and have been implicated as key modulators of ACP association””. 
The most universal electrostatic interaction shown across the intact-, 
hydrolysed- and holo-ACP structures is between Arg 293 of LpxD and 
Asp 35, Asp 38 and Glu 41 of ACP that flank Ser 36. This coordination 
point is crucial for ACP association with LpxD as mutation of Arg 293 
to alanine results in a 23-fold increase in Michaelis constant (K,,) for 
acyl-ACP compared with wild-type LpxD, with little effect on catalyst 
rate constant (k.a)'”. These interactions orient ACP by positioning the 
pantetheinylated Ser 36 residue towards the catalytic chamber of LpxD. 
In addition to the protein-protein interactions, the prosthetic group 
and acyl chain of intact-acyl-ACP extensively contact the surface of the 
partner enzyme (Supplementary Fig. 9). Inspection of the electron 
density indicates that the thioester scissile bond remained unbroken, 
which required a His239Ala LpxD variant (Supplementary Fig. 5a). 
The B-OH-C14 acyl chain and the 4’-PPT arm adopt a horseshoe-like 
conformation, which in effect buries the acyl chain between the pros- 
thetic group and a pronounced hydrophobic channel (N-channel) 
formed between LpxD monomers (Fig. 2c). The 4’-phosphate moiety 
of 4’-PPT is directly coordinated by Asn 310 and Arg 314 of LpxD and 
may partly contribute to some level of specificity. The remainder of 
the 4’-PPT arm adopts a rather extended conformation, stretching 
over 14 A, and interacts with residues primarily along the rim of the 
N-channel. This conformation places the thioester bond in proximity to 
the alanine-substituted His 239 catalytic base and orients the carbonyl- 
oxygen of B-OH-C14 towards the amide nitrogen atom of Gly 257”, 
corroborating its role in forming the oxyanion hole’*”*. 
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Figure 1 | Stalled ACPs bound to 
LpxD. a, The three forms of ACP: 
intact-acyl-ACP (green), 
hydrolysed-acyl-ACP (salmon) and 
holo-ACP (red). ACP adopts a four- 
helix bundle (I-IV) with several loop 
regions (L1-L3)**”’. The 4'-PPT arm 
and acyl chains are shown as stick 
models and coloured orange or dark 
grey, respectively, and by atom. 

b, Overall architecture of the ACP- 
6 LpxD complex (intact-acyl-ACP 
shown). Three ACPs bind the 
C-terminal end of the LpxD trimer 
(coloured by chain purple/magenta/ 
light pink). c, Cartoon rendering 
showing the overall fold and 
interaction of acyl-ACP with LpxD. 
Highlighted dark purple is a single 
monomer of LpxD with subdomains 
indicated: uridine-binding domain 
(UBD), left-handed B-helix domain 
(LBH) and C-terminal domain 
(CTD). ACP, interfacing with the 
ARD, and its acyl-4'-PPT group are 
shown. The locations of the catalytic 
base (His 239), oxyanion hole 

(Gly 257), and molecular 
‘hydrocarbon ruler’ (Met 290) are 
indicated within the ten B-helical 
coils (orange strands) of a single left- 
handed f-helix domain. 


Holo-ACP 


Intact-acyl-ACP 


Two features of LpxD specificity towards B-hydroxyacyl chains are 
explained by the intact-acyl-ACP structure. First, the terminal two 
carbon atoms of B-OH-C14 pack against Met 290’’ located at the far 
end of the N-channel (Fig. 2c and Supplementary Fig. 9). Given that 
LpxD is highly specific for 14-carbon acyl-chain-lengths", this sup- 
ports the role of Met 290 as a ‘hydrocarbon ruler’. Second, LpxD is 
particularly selective for the B-hydroxyl group of the acyl chain; its 
removal completely abrogates acyltransferase activity”’. Our structures 
show an intricate hydrogen bond network between the B-hydroxyl 
group and Asp 232’, Gln 236’ and a critical water-mediated bridge 
with Asp 216’’ and the main-chain nitrogen atoms of Phe 183’’ and 
His239Ala’’. 

By contrast to the intact-acyl-ACP complex, the thioester bond is 
broken in the hydrolysed-acyl-ACP structure (Fig. 2d, Supplementary 
Fig. 5b and Supplementary Fig. 9). Notably, we discovered a second 
molecule of B-OH-C14 fatty acid bound to the LpxD surface (Sup- 
plementary Fig. 10), which shows an extra hydrophobic channel 
(O-channel) that probably binds the acyl chain that is ester-linked to 
the UDP-acyl-GIcN lipid substrate (Supplementary Fig. 1b). A super- 
position with the previously reported Chlamydia trachomatis LpxD in 
complex with UDP-GIcNAc’® illustrates the proximity of the carbox- 
ylate head group of the extra fatty acid to the anticipated binding locale 
of the 3-hydroxyl position of the GIcN ring (Supplementary Fig. 11). 

What is striking about both the intact- and hydrolysed-acyl-ACP 
structures is the conformational similarity between the 4’-PPT pros- 
thetic groups that are fully extended (Fig. 2c, d), whereas the holo- ACP 
structure shows an alternative conformation of its prosthetic group 
packing against the far end of the N-channel (Fig. 2e and Supplemen- 
tary Fig. 9). Notably, in both intact- and hydrolysed-acyl-ACP the 
pantetheinyl arms completely enclose the reaction chambers (Fig. 3a). 
Although this architecture probably stabilizes substrate binding within 
the hydrophobic N-channel, it raises a key question. Given a sequential 
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ACP 


LpxD 
threefold axis 


N-channel 


Figure 2 | Intermolecular interactions between ACP and LpxD. 

a, b, Overview of the protein-protein interactions (intact-acyl-ACP depicted). 
a, Top-down view of the ACP-LpxD complex showing three molecules of ACP 
bind per LpxD trimer. b, Electrostatic surface representation of the ARD and 
ACP (inset); the potential contours were scaled to +79.2 (blue) and —79.2 (red) 
kgTe * (where kg is the Boltzmann constant, T is temperature and e is the 
charge of an electron). c-e, Detailed interactions between the LpxD reaction 
chamber and the bound acyl/4’-PPT groups of ACP. Hydrogen bonds are 
shown as black dashes. Molecular surfaces are of only those residues that 


ordered reaction mechanism, how does the di-acylated-GIcN product 
leave LpxD before holo-ACP if the reaction chamber is completely 
blocked? A structural comparison between intact- and holo-ACP 
sheds light on this matter by exposing a substantial movement that 
the 4’-PPT arm has undergone (Fig. 3b). Notably, the terminal thiol of 
the 4’-PPT has vacated the catalytic cleft and moves approximately 
15A to be situated near Met 290. A sizeable region of the reaction 
chamber closest to the catalytic cleft is now open to solvent, thereby 
giving the diacyl-GlcN product an opportunity to dissociate before the 
release of holo-ACP. 

The remarkable difference in conformation shown by the 4'-PPT 
arm, together with the ordered sequential mechanism of acyl transfer, 
prompted us to propose that this ‘swing’ motion may in fact be involved 
in ‘triggering’ the release of lipid product. Because every terminal thiol 
of holo-ACP was positioned approximately 3.7 A away from Met 290 
of LpxD, we considered the possibility of mutating this residue to a 
cysteine in an effort to induce a mixed-disulphide linkage post-catalysis. 
Biochemically, the Met 290Cys mutation abrogates acyl transfer to 
UDP-acyl-GlcN compared with wild-type LpxD (Supplementary Fig. 12). 
This suggests that a covalent bond is formed between the cysteinyl and 
4'-PPT thiols. Thus, we reasoned that the addition of reducing agent 
would rescue acyl transfer. Accordingly, titration of dithiothreitol (DTT) 
into the reaction mixture recovers activity of Met290Cys-LpxD to 
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N-channel fe 
(upper portion) 


contribute to interactions. c, Intact-acyl-ACP complex. The B-OH-C14 acyl 
chain delivered by ACP binds the hydrophobic N-channel and its terminal 
carbon atoms pack near Met 290’’. d, Hydrolysed-acyl-ACP complex. An 
equivalent B-OH-C14 acyl chain is shown bound to a newly identified 
hydrophobic channel (O-channel). e, Holo-ACP complex. The 4'-PPT arm 
interacts at the far end of the N-channel, positioning the terminal thiol near 
Met 290’’. A ‘ghost outline’ of the catalytic cleft is indicated by a grey dotted 
line. 


levels indistinguishable from that of wild-type enzyme (Supplemen- 
tary Fig. 12). These data suggest that the observed 4’-PPT motion has 
functional relevance in the course of product release. Moreover, as 
typically exemplified with ‘strong, transient’ interactions”, this sub- 
stantial movement of the pantetheine arm probably serves as the ‘molecu- 
lar trigger’ that promotes the collapse of the ACP-partner complex. 
Because our structures show different states of ACP stalled at the 
LpxD active site, an alignment of LpxD domains allows us to visualize 
the movements within ACP as it relates to the overall catalytic cycle 
(Fig. 3c), which to the best of our knowledge is unprecedented. Helix-I, 
helix-II and portions of L1 remain relatively unchanged (root mean 
squared deviation of approximately 1 A) and are preserved for the 
purpose of remaining docked. The largest differences occur down- 
stream of the recognition helix (Fig. 3c), including helices III, IV, L2 
and L3 (root mean squared deviation of approximately 3 A) and are an 
indication of what interactions must break for dissociation. A closer 
inspection shows that the intact-complex makes electrostatic interac- 
tions with the entire ARD interface that involves region II, whereas 
both the hydrolysed- and holo-ACP structures do not (Fig. 3d). These 
data indicate that the more extensive interactions are important for 
molecular recognition during association; however, after acyl transfer 
it is conformational changes in ACP that ultimately destabilize the 
protein-protein complex. In this context, one particular molecule of 
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© intact-4’-PPT 
@ Holo-4’-PPT 


Chamber 


ACP 


holo-ACP within the asymmetric unit is displaced from the others and 
possibly represents a different binding state showing extra interactions 
that must break for dissociation (Supplementary Fig. 13). 

The structures captured in this study begin to establish key molecu- 
lar movements within ACP that initiate molecular recognition and a 
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Figure 4 | Molecular basis for the ordered-sequential reaction mechanism 
and involvement of ACP in lipid-product release. a, b, Acyl-ACP (a) binds 
first to free LpxD forming the binary complex (b). ACP associates with the ARD 
and the acyl-4’-PPT packs into the hydrophobic N-channel. c, UDP-acyl-GlcN 
binds next, which initiates acyl transfer. d, In the ternary product complex 
the 4'-PPT arm of hydrolysed-acyl-ACP (transparent orange squiggle) 


Pivot 
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Figure 3 | ACP conformations and 

reorganization of its prosthetic 
} group. a, Top-down view of LpxD 
(intact-acyl-ACP depicted) showing 
three reaction chambers enclosed by 
the 4’-PPT. b, Structural comparison 
between the intact- (green) and holo- 
4'-PPT (red) prosthetic groups. The 
4'-PPT rearrangement is indicated 
(grey dotted arrow). c, Difference 


f a distance matrix calculated between 


fully modelled intact- (chain K) and 
holo-ACP (chain G). Deviations 
between like atoms are shown as a 
putty-sausage representation. Both 
the thickness and heat-map 
colouring indicate regions of least 
(thin, blue) to highest (thick, red) 
displacement. d, Schematic 
summarizing differences in 
electrostatic interactions between 
ACP complexes. Residues are 
indicated as red (acidic) or blue 
(basic) circles and subdivided 
according to region I or II affiliation. 
Interactions made by intact-acyl- 
ACP alone are depicted (dotted lines) 
and those common among all 
complexes are represented with a 
star. The grey zigzag indicates those 
electrostatic interactions that are 
broken in the hydrolysed- and holo- 
ACP product complexes. 


ARD 


mechanism with which associations can be ultimately broken. In this 
context, we present a model for ACP-based synthesis of lipid A precursors 
catalysed by LpxD (Fig. 4) and can begin to extract some general 
principles for these types of interaction. First, although there does 
not yet seem to be a consensus-sequence binding motif on the surface 
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completely encloses the reaction chamber, blocking UDP-diacyl-GlcN from 
leaving. By moving the 4'-PPT (dark orange squiggle) towards Met 290 (dotted 
arrow), the catalytic chamber opens up. e, f, This motion drives the eventual 
release of UDP-diacyl-GIcN (e) and ‘triggers’ conformational changes 
downstream of helix-II leading to holo-ACP dissociation (f). 
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of partner proteins, both regions I and II of ACP seem to be frequently 
involved, with most of the small number of other ACP-based com- 
plexes determined showing that helices II, -III, L1 and L2 are consis- 
tently used”””*?°*”, Second, because these regions are broadly universal, 
exploiting specific residue contacts at the interface and fine-tuning the 
buried surface area at these regions probably dictate how ACP-based 
associations can be both specific and temporary. For instance, our 
structures show the first direct, electrostatic coordination with the 
4'-phosphate of both acyl- and holo-ACP, which in addition to extens- 
ive protein-protein contacts provide the ‘strong, transient’ nature of 
the ACP-LpxD complex. Third, because we observe ACP tied to an 
entire catalytic cycle, a substantial conformational change downstream 
of the recognition helix-II may represent a more general communica- 
tion mechanism for breaking ACP-partner complexes. Finally, the 
stalled ACPs captured herein have shown an unprecedented role for 
carrier proteins in product release, and the contribution of the pan- 
tetheinyl group in both the formation and dissociation of the ACP- 
partner complex. Perhaps motion of the pantetheinyl group in other 
ACP-based complexes provokes dissociation in a similar manner, 
especially when considering other ‘strong, transient’ protein partners. 


METHODS SUMMARY 


Key to capturing ACP bound to LpxD at different stages of catalysis was the ability 
to preload E. coli ACP with its 4'-PPT prosthetic group by co-expression with 
holo-ACP synthase (ACPS), and subsequently charging holo-ACP with R-3- 
hydroxymyristic acid enzymatically using the soluble form of acyl-ACP synthetase 
from Vibrio harveyi. Holo-ACP was purified by nickel affinity, anion exchange, 
covalent chromatography, which exploits disulphide bond formation between the 
4'-PPT group and ThioPropyl Sepharose resin (Sigma-Aldrich), and finally size- 
exclusion chromatography. Complete conversion of holo-ACP to its acylated form 
was confirmed by mass spectrometry and 2.5M urea gel electrophoresis. The 
resulting acyl-ACP mixture was purified to homogeneity using size-exclusion 
chromatography. E. coli LpxD was overexpressed, and purified as previously 
reported”. An active site mutant of LpxD was also necessary to preserve the 
thioester bond of acyl-ACP and was generated by introducing an alanine substi- 
tution for His 239. Crystals of the complexes were grown by vapour diffusion and 
incubating either holo- or acyl-ACP with LpxD in a 1:1 (mol:mol) stoichiometry. 
Diffraction data were collected on the SER-CAT 22-BM and 22-ID beamlines at 
the Advanced Photon Source at Argonne National Laboratory. The structures 
were solved by the molecular replacement method using the published E. coli 
LpxD structure (Protein Data Bank 3EH0)’’ as the search model. E. coli ACP 
was intentionally omitted during molecular replacement and, instead, was manu- 
ally rebuilt into unbiased difference electron density maps. The rate of LpxD 
catalysed conversion of [o-?*P]JUDP-3-acylGIcN to [a-**P]UDP-2,3-diacylGlcN 
was measured as described previously using thin-layer chromatography”. Details 
of all procedures are presented in Supplementary Information. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Expression and purification of Hisg-LpxD. E. coli LpxD was overexpressed, 
and purified as previously reported’. Briefly, Hiss-LpxD was expressed in E. coli 
Rosetta/pLysS. The membrane-free fraction was loaded onto a 5ml Ni-NTA 
(Qiagen) column and eluted in one step with 200 mM imidazole. The His¢-tag 
was left intact and the resulting LpxD was concentrated to approximately 4-10 ml 
and loaded onto a High Load 26/60 Superdex 200 gel filtration column (GE) 
equilibrated with 10 mM Tris-HCl, pH 7.5, 500mM NaCl and 1 mM DTT. The 
protein peak had an elution profile consistent with that of the LpxD homotrimer. 
Fractions were pooled and concentrated to approximately 40 mg ml and stored 
in aliquots at —80°C. The wild-type LpxD enzyme used in the LpxD assay was 
purified and stored in the absence of DTT. 
Construction, expression and purification of Hisg-LpxD point mutants. Site- 
directed mutants designed to alter Met 290 (Met290Cys), and His 239 (His239Ala) 
were accomplished using the QuikChange PCR mutagenesis kit (Stratagene). The 
LpxD variants were recombinantly expressed in C41(DE3) E. coli cells and purified 
in the same manner as described for wild-type Hisg-LpxD. The His239Ala LpxD 
was stored in 10mM Tris-HCl, pH 7.5, 500mM NaCl and 1mM TCEP, and 
Met290Cys LpxD was purified and stored in the absence of a reductant. 
In vitro assay of LpxD. The LpxD catalysed conversion of [a-**P]UDP-3-acylGIcN 
to [a-*’P] UDP-2,3-diacylGlcN was monitored by thin-layer chromatography as 
previously reported’>. The 20 il assay mixture containing 40 mM HEPES (pH 7.4), 
0.02 mg ml! BSA, 1.4nM pure wild-type E. coli LpxD (or 4.2 nM Met290Cys LpxD), 
6 iM R-3-hydroxymyristoyl-ACP (B-OH-C14-ACP) and 4 1M [«-?*P]UDP-3-O- 
(B-OH-C14)-GIcN (0.005-0.04 [Ci ul?) was equilibrated at 30 °C, and the reac- 
tion was initiated by the addition of the enzyme. DTT (100 mM) was added to 
assess its effect on the catalysis of the wild-type and mutant LpxD. A 1 pil aliquot of 
the reaction mixture was spotted onto a silica gel 60 plate (EMD Chemicals) at 
various time-points. After drying under a cold air stream, plates were developed 
with a chloroform/methanol/water/acetic acid solvent (25:15:4:2, v:v:v:v). The plates 
were dried and exposed overnight to a Molecular Dynamics PhosphorImager 
Screen. The conversion rate of the lipid substrate to product was measured using 
ImageQuant software. 
Expression and purification of holo-ACP. The acp gene was amplified from 
E. coli W3110A genomic DNA using the ACP-forward and ACP-reverse primers 
(Supplementary Table 1) engineered to introduce downstream of the ACP coding 
region a spacer sequence, a ribosome binding site, followed by a translation spacer 
element (denoted PCR product A). The ACPS-forward and ACPS-reverse primers 
(Supplementary Table 1) were used to amplify the acps gene from W3110A geno- 
mic DNA, which contained a spacer sequence, a ribosome binding site and a 
translation spacer element upstream of the ACPS coding region (denoted PCR 
product B). The ACP-forward and ACPS-reverse primers were subsequently used 
to generate PCR product C, which contained acp and acps genes separated by the 
spacer sequence, ribosome binding site and the translation spacer element. PCR 
product C was cloned into a pET16b* vector, which contained an engineered 
tobacco etch virus (TEV) protease cleavage site instead of Factor Xa, using the 
NdeI and Xhol restriction sites. E. coli DH5«-competent cells were subsequently 
transformed with plasmid and transformants were selected at 37 °C on a LB-agar 
plate supplemented with 100,1gml~' ampicillin. The sequence of the plasmid 
encoding the N-terminally histidine-tagged ACP and non-tagged ACPS (pET16b*- 
AM) was confirmed by DNA sequencing. 

ACP and ACPS were co-expressed using the pET16b*-AM vector in C41(DE3) 
E. coli cells and cultured at 37 °C in 1 litre of LB broth supplemented with 100 1g 
ml! ampicillin. Upon reaching an absorbance at 600 nm of 0.6, the expression 
was induced for 6h with 1 mM isopropyl-B-p-thiogalactopyranoside (IPTG) at 
30°C. All the subsequent procedures were performed at 4°C. The cells were 
collected, washed, re-suspended and lysed in 20 mM HEPES, pH 8.0 containing 
10% glycerol, 200mM NaCl and 2mM DTT (buffer A), supplemented with 
20 mM imidazole. After removal of cell debris by centrifugation at 100,000g for 
1h, the soluble fraction was loaded onto a Ni-NTA (Qiagen) column equilibrated 
with buffer A and 20 mM imidazole. The Ni-NTA column was washed with 10 
column-volumes and His; 9-ACP was eluted in one step by buffer A supplemented 
with 250 mM imidazole over the equivalent of five column-volumes. Although 
ACPS was not histidine-tagged, it co-purified with His,o-ACP. The elution fraction 
was allowed to slowly rock with 1 mg of TEV protease and 2 mM EDTA overnight. 
Subsequently, the protein mixture was dialysed against 20 mM MES, pH 6.0 and 
2mM DTT overnight. The TEV protease and residual His,)>-ACP were removed 
by passing the mixture through a 2ml Ni-NTA column. The resulting flow- 
through fraction containing ACP was loaded onto a 5 ml QFF ion exchange col- 
umn (GE). ACP was eluted separately from ACPS by applying a linear gradient of 
20mM MES, pH 6.0, 0-500 mM NaCl and 2 mM DTT over 50 column-volumes. 
The fractions corresponding to ACP were pooled together and dialysed against 
100 mM Tris-HCl, pH 7.1 and 200 mM NaCl (buffer B) overnight. Holo-ACP was 
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separated from residual apo-ACP by mixing the ACP sample with 5ml of 
ThioPropyl Sepharose 6b resin (Sigma-Aldrich) overnight. Apo-ACP, which lacks 
any free thiol group, did not bind to the resin, whereas holo-ACP covalently 
attached to the resin by the terminal thiol group of its phosphopantetheine moiety. 
After washing the Thiopropyl Sepharose 6b with 25 ml of buffer B, holo-ACP was 
eluted in 25 ml of buffer B supplemented with 25 mM DTT. The complete removal 
of apo-ACP was confirmed by electrospray ionization mass spectrometry. The 
elution fraction was concentrated to approximately 4-10 ml and loaded onto a 
High Load 26/60 Superdex 200 gel filtration column equilibrated with 10 mM Tris- 
HCl, pH7.5, 200mM NaCl and 2mM DTT. The relevant eluted fractions were 
concentrated to approximately 20 mg ml and stored in aliquots at —80 °C. 
Production of B-OH-C14-ACP. Holo-ACP was charged enzymatically with R-3- 
hydroxymyristic acid (Santa Cruz Biotechnology) by soluble acyl-ACP synthetase 
from V. harveyi”. The soluble acyl-ACP synthetase (aasS) gene was synthesized by 
GenScript. The aasS gene was subcloned into pET-16b expression vector, over- 
expressed, and the His,-AasS was purified using nickel affinity chromatography as 
reported by Jiang et al.°°. To generate B-OH-C14-ACP, 0.1 mM of holo-ACP was 
mixed with 0.001 mM AasS and 0.3 mM of the fatty acid (B-OH-C14) at room 
temperature for 3 h in a buffer containing 100 mM Tris-HCl, pH 7.8, 10 mM ATP 
and 10mM MgCl). To separate acyl-ACP from AasS, the reaction mixture was 
loaded onto a High Load 26/60 Superdex 200 gel filtration column equilibrated 
with 10mM Tris-HCl, pH 7.5 and 200mM NaCl. The complete conversion of 
holo-ACP to B-OH-C14-ACP was confirmed both by electrospray ionization mass 
spectrometry and 2.5 M urea (19%) polyacrylamide (pH 9.5) gel electrophoresis”. 
Crystallization and structure determination. Before crystallization, either acyl- 
ACP or holo-ACP was mixed with the wild-type or the catalytically inactive His239Ala 
LpxD to preform the protein-protein complex. Crystals of holo-ACP-LpxD were 
grown at 15 °C by mixing the protein solution with the precipitant (0.1 M MES pH 
6.0, 0.2 M lithium sulphate, 20% PEG 4000) in ratios of 1:1 and 1:1.5. Crystals 
achieved full size in approximately 45 days. The crystals were transferred to a cryo- 
solution using a 50:50 ratio of paratone to mineral oil and immediately cryo-cooled 
to —180 °C in liquid nitrogen. Hydrolysed-acyl-ACP-LpxD crystals were obtained 
by equilibrating the protein mixture against a well solution containing 0.1 M MES 
pH 6.5, 0.2 M ammonium sulphate, 20% PEG 8000 and incubating at 15 °C. The 
crystals were collected on day 10 and cryo-cooled using a solution containing 
ammonium sulphate, 33% PEG 8000, 5mM Tris-HCl pH 7.5, 190mM NaCl 
and 20% of the cryoprotectant ethylene glycol. To trap the intact-acyl-ACP- 
LpxD complex the His239Ala-LpxD mutant was purified in the presence of the 
reducing agent TCEP instead of DTT in an attempt to reduce hydrolysis of the 
thioester bond. His239Ala-LpxD and acyl-ACP were pre-mixed using a 1:1 molar 
ratio before crystallization trials. The well solution and the cryoprotectant were the 
same as that described for the hydrolysed-acyl-ACP structure; however, a ratio of 
1.5:1 of protein to well solution was required. 

The diffraction data were collected on the SER-CAT 22-BM and 22-ID beam- 
lines at the Advanced Photon Source at Argonne National Laboratory. Data were 
processed using the HKL2000” software suite (Supplementary Table 2). Although 
the data for both intact- and holo-ACP complexes were processed to 2.1 and 
2.13 A, respectively, the hydrolysed-acyl-ACP complex was trimmed to 2.9A 
owing to the data completeness being unsatisfactory in higher resolution bins (that 
is, well below 70%). This resulted in a higher signal-to-noise ratio in the 2.9A 
resolution bin as well as a much lower Ryerge Value. The crystals of each complex 
belong to the P1 space group. The structures were solved by the molecular replace- 
ment method using the program PHASER in the PHENIX software suite**** and 
the previously determined E. coli LpxD structure (Protein Data Bank 3EH0)”” as 
the search model. Two trimers of LpxD were observed in the triclinic unit cell for 
the intact-acyl-ACP, hydrolysed-acyl-ACP and holo-ACP co-crystal structures. In 
each case, however, ACP was intentionally omitted during the molecular replace- 
ment process; instead, it was manually rebuilt into unbiased, contiguous F, — F- 
difference electron density by first rigid-fitting the E. coli apo-ACP coordinates 
(Protein Data Bank 1T8K)*° into the resulting maps. The models were rebuilt 
using COOT” and iterative structure refinement with restrained and TLS options 
was performed using PHENIX*’. For the lower-resolution hydrolysed-acyl-ACP 
structure, hydrogens were included for refinement with automated optimization 
of X-ray/stereochemistry and ADP weights selected. Additionally, we used the 
high-resolution E. coli apo-ACP structure (Protein Data Bank 1T8K) asa reference 
model. The molecular coordinates and restraints of the 4’-PPT, B-OH-C14-4’- 
PPT and free B-OH-C14 fatty-acid ligands were generated by using either the 
Dundee PRODRG? Server” or PHENIX Elbow”. Composite omit map and simu- 
lated annealing omit map calculations were conducted using CNS*. The protein- 
ligand interactions were identified by AREAIMOL calculations in the CCP4 
suite”. The quality of the final models was validated using MOLPROBITY™. 
The data statistics are reported in Supplementary Table 2. Molecular figures were 
generated using PYMOL”". 
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A total of six ACP molecules were present in the intact-acyl-ACP structure 
(Supplementary Fig. 3). Both the hydrolysed- and holo-ACP structures showed 
partly bound states of ACP within two or three LpxD active sites as interpreted by 
the lack of contiguous electron density for most of the ACP backbone. As a result, 
only a portion of ACP that includes Ser 36 and its 4’-PPT prosthetic group was 
included in the final model: (1) hydrolysed-acyl-ACP, residues 35-44 of chain L, 
residues 6-15, 27-53 and 62-73 of chain I, residues 1-15 and 27-73 of chain G; (2) 
holo-ACP, residues 35-44 of chain H and chain L. The remaining LpxD active 
sites in the hydrolysed- and holo-ACP structures contain fully modelled ACPs. 
In all three co-crystal structures, the electron density maps indicated that the 
N-terminal methionine of ACP was present and forms a key lattice contact with 
neighbouring molecules of LpxD. Two more residues (Ser-His) in the holo-ACP 
structure that remain from the TEV cleavage site could also be modelled. 

The observed electron density for fully modelled intact-, hydrolysed- and holo- 
ACPs were weaker in some regions, especially on the backside of the molecule 
which faces solvent; however, the placement of side-chains was apparent at the 
protein-protein interface (Supplementary Fig. 3). This implicates conformational 
heterogeneity throughout the lattice. Nonetheless, in all three co-crystal structures, 
electron density was apparent in every active site for all atoms of the 4’-PPT group. 
In both the intact- and hydrolysed-forms, pronounced electron density was also 
present for the B-OH-C14 acyl chains located in the N-channel. In addition, 
electron density indicated that the hydrolysed-acyl-ACP complex included two 
extra molecules of B-OH-C14 fatty acid bound to the hydrophobic O-channel, 
although the density was weaker towards the terminal carbon atoms of the acyl 
chains. To investigate the origin of this second fatty acid, we mixed acyl-ACP 
(12.86 mg ml!) ina 1:2.25 v:v ratio with wild-type LpxD (26.38 mg ml!) pre- 
incubated with 1mM DTT in a solution that was consistent with the condition 
used for crystallization. The protein solution was incubated at 15 °C, and aliquots 
were taken at different time points and stored at —80 °C. Samples were run on a 
2.5M urea (19%) polyacrylamide (pH 9.5) gel*', which showed that both DTT 


(a known phenomenon”) and LpxD enhance the cleavage of the thioester bond of 
acyl-ACP (Supplementary Fig. 10). This observation most likely explains why free 
B-OH-C14 fatty acid was available to bind the O-channel of LpxD. 
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Acyl carrier protein (ACP) transports the growing fatty acid chain 
between enzymatic domains of fatty acid synthase (FAS) during bio- 
synthesis’. Because FAS enzymes operate on ACP-bound acyl groups, 
ACP must stabilize and transport the growing lipid chain’. ACPs 
have a central role in transporting starting materials and intermedi- 
ates throughout the fatty acid biosynthetic pathway*°. The transient 
nature of ACP-enzyme interactions impose major obstacles to obtain- 
ing high-resolution structural information about fatty acid biosyn- 
thesis, and a new strategy is required to study protein-protein interactions 
effectively. Here we describe the application of a mechanism-based 
probe that allows active site-selective covalent crosslinking of AcpP 
to FabA, the Escherichia coli ACP and fatty acid 3-hydroxyacyl-ACP 
dehydratase, respectively. We report the 1.9 A crystal structure of 
the crosslinked AcpP-FabA complex as a homodimer in which AcpP 
exhibits two different conformations, representing probable snap- 
shots of ACP in action: the 4’-phosphopantetheine group of AcpP 
first binds an arginine-rich groove of FabA, then an AcpP helical 
conformational change locks AcpP and FabA in place. Residues at 
the interface of AcpP and FabA are identified and validated by solu- 
tion nuclear magnetic resonance techniques, including chemical shift 
perturbations and residual dipolar coupling measurements. These 
not only support our interpretation of the crystal structures but also 
provide an animated view of ACP in action during fatty acid dehyd- 
ration. These techniques, in combination with molecular dynamics 
simulations, show for the first time that FabA extrudes the seques- 
tered acyl chain from the ACP binding pocket before dehydration 
by repositioning helix III. Extensive sequence conservation among 
carrier proteins suggests that the mechanistic insights gleaned from 
our studies may be broadly applicable to fatty acid, polyketide and 
non-ribosomal biosynthesis. Here the foundation is laid for defin- 
ing the dynamic action of carrier-protein activity in primary and 
secondary metabolism, providing insight into pathways that can 
have major roles in the treatment of cancer, obesity and infectious 
disease. 

In E. coli, AcpP interacts with at least 12 enzymes involved in fatty 
acid biosynthesis, plus nine other enzymes from disparate biosynthetic 
pathways (Fig. 1a and Supplementary Fig. 1)°'°. AcpP sequesters grow- 
ing metabolites in an interior hydrophobic cavity that protects these 
intermediates from non-selective reactivity”, and selective protein- 
protein interactions are believed to be a prerequisite for the delivery 
of ACP-bound substrate to its catalytic partners’. Given the import- 
ance of ACP-protein interactions in metabolism and cell regulatory 
processes, understanding this ‘switchblade mechanism’ (Fig. 1c) is 
crucial’, although this has proven elusive owing to the inherently 
transient nature of ACP-partner complexes’. 

We recently deployed synthetic probes to study ACP activity and 
protein-protein interactions", including a sulphonyl-3-alkyne-based 
probe (1, Fig. 1b) designed to capture ACP in functional association 


with 3-hydroxyacyl-ACP dehydratase with demonstrated specificity 
between E. coli AcpP and FabA (Fig. 1c, d)'*"®. Probe 1 applied to AcpP 
and FabA creates a uniformly crosslinked species (AcpP-FabA) that 
forms reproducible crystals in tag-free form (Supplementary Fig. 4). 
No AcpP-FabA protein complex crystals form without 1, demonstrat- 
ing the necessity of applying probes such as 1 to capture ACP in action. 

The AcpP-FabA crystals diffracted to 1.9 A (Supplementary Table 2), 
and we solved the AcpP-FabA crystal structure by molecular replacement 
using the apo-FabA dimer (Protein Data Bank (PDB) code 1MKA)’” 
and two butyryl-AcpP (PDB 2K94) as search templates (Supplemen- 
tary Fig. 5). Final refinement revealed the structure of a one-to-one ratio 
between AcpP and FabA yielding a AcpP,-FabA, complex (Fig. 2a), 
consistent with protein-sizing data in solution. The dimeric FabA forms 
a ‘double hotdog’ topology"®, with two antiparallel ‘hotdog’ helices sur- 
rounded by a combined 14-stranded [-sheet (Fig. 2a)’. The two AcpP 
monomers adopt a four-helix bundle fold’ and dock helices II-III with 
the B5-6 loop of FabA (Fig. 2d). The contact area is small (503 A? and 
539 A? for the first and second AcpP-FabA interface), consistent with 
the transient nature of AcpP-partner interactions. AcpP has a nega- 
tively charged cleft between helices II and III, which interacts with a 
positively charged arginine-rich patch on FabA, the ‘Positive Patch’, 
Fig. 2c)’. The AcpP-FabA interface interactions are primarily electro- 
static but also include conserved, hydrophobic residues (Fig. 2d, 
detailed in Supplementary Information). The high sequence conser- 
vation of negatively charged residues on helices II and III of AcpP at 
the AcpP-FabA interface (Supplementary Figs 2, 3) is consistent with 
previous reports of ACP-partner complex structures (Fig. 4d)’?°”’, 
strongly supporting the presence of the ‘Positive Patch’ in ACP partner 
proteins. 

Only the position of R137 differs between the two FabA protomers, 
but in the ACP structures many residues of helix III move extensively 
(Fig. 2d and Supplementary Information), resulting in different topo- 
logy near the contact interface between helices II and III (Fig. 2c and 
Supplementary Information). Thus the second AcpP-FabA interaction 
probably represents a snapshot when AcpP completes its docking with 
FabA, resulting in less disorder of AcpP2. Accordingly, the first AcpP- 
FabA interaction would represent a snapshot of AcpP in transition, 
where extensive movement of helix III is necessary in order for AcpP1 
to bind or dissociate from FabA. 

The natural FAS substrates contain both 4’-phosphopantetheine 
(PPant) and acyl-chain moieties (Fig. 1b), and the application of probe 1 
shows how both bind to FabA (Supplementary Figs 6, 7). Probe 1 cova- 
lently connects the active site S36 of AcpP and H70 of FabA and binds 
ina highly conserved tunnel of FabA (detailed in Supplementary Infor- 
mation). Unlike acyl-AcpP structures that contain a hydrophobic interior 
pocket to sequester the acyl chain’’, the AcpP in the AcpP-FabA complex 
structure contains no interior pocket (Fig. 2e) and closely resembles 
apo-AcpP (Supplementary Table 4)**, because five conserved hydrophobic 
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Figure 1 | E. coli AcpP and crosslinking strategy. a, AcpP is a small, acidic 
protein comprised of four o-helices that interacts with at least 21 catalytic 
enzymes, 12 of which belong to FAS (10 shown here). The apolar interior of 
helix II (42) and helix III (3) form a hydrophobic cavity that sequesters the 
growing metabolite attached to the PPant arm. b, A native substrate of FabA 
(top) and modified AcpP with targeted sulphony1-3-alkynyl crosslinking probe 


residues between helices II and III move inward and collapse the inter- 
ior pocket (Fig. 2f). This marked change reflects a dynamic AcpP mov- 
ing from the sequestered-substrate state to the open switchblade state 
to position the substrate within the partner enzyme FabA (Fig. Ic, d). 

Further characterization of the dynamic interactions between AcpP 
and FabA was achieved through comparisons with two-dimensional 
1H/1°N heteronuclear single quantum coherence (HSQC) spectra of 
holo-AcpP and octanoyl-AcpP. Many resonances displayed chemical 
shift perturbations (CSPs) in residues that define the hydrophobic pocket 
of AcpP (Supplementary Fig. 11). Titrating unlabelled apo-FabA into 
each sample allowed us to observe CSPs resulting from dynamic, non- 
covalent association with FabA in solution (Fig. 3a, b). In the holo- 
AcpP-FabA titration experiment, we observed significant CSPs in residues 
spanning helices II and III and the adjacent loops (Supplementary 
Fig. 12). In the octanoyl-AcpP-FabA titration experiment, additional 
CSPs were observed in residues lining the hydrophobic pocket of AcpP 
(Fig. 3c), which we attribute to the translocation of the bound acyl 
chain out of the AcpP pocket into the FabA active site to complete 
the ‘switchblade’ process. 

We then acquired a two-dimensional HSQC spectrum of the AcpP- 
FabA complex in 1:2 stoichiometry, with one AcpP crosslinked to each 
FabA homodimer. When overlaying the HSQC spectra of the AcpP-FabA 
complex with the holo- and octanoyl-AcpP titration data (Fig. 3a, b) 
obtained with experiments incorporating transverse relaxation optimized 
spectroscopy~’, we observed a striking correlation between the CSP 
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(middle), derived from the crosslinking pantetheinamide analogue 1 (bottom). 
c, Proposed mechanism of FabA. Protein-protein interactions between AcpP 
and FabA induce release of the sequestered substrate from AcpP into the active 
site of FabA, where dehydration is catalysed. d, Crosslinking strategy to form 
AcpP-FabA with mechanism-based crosslinking probe 1. 


shifts in the two AcpP species and the HSQC spectrum of the complex; 
the chemical shift of each residue migrates towards the observed chemical 
shift in the crosslinked complex as the concentration of FabA increases. 
The similarities between CSPs of the transient binding event and our 
crosslinked complex (Fig. 3c, f) indicate that the binding conformation 
in the AcpP-FabA complex is truly indicative of the natively bound 
conformation. 

From the crystal structure we identified acidic residues E41, E47, 
E53 and E60 of AcpP that interact with FabA (Fig. 2d and Supplemen- 
tary Fig. 6), and correspondingly observed large CSPs in these residues 
between helices II and III (Fig. 3c—e). In addition, we found large CSPs 
in the hydrophobic helix II residues observed in the binding interface 
of the crystal structure, such as L37, V40 and M44 (Figs 2d and 3 and 
Supplementary Fig. 6). Similarly, crystal structure observations are con- 
sistent with CSP plots for both octanoyl-AcpP with crosslinked AcpP- 
FabA (Fig. 3c-e and Supplementary Table 8): the strong CSPs for A59, 
E60, E41 and E47 correspond with side-chain interactions within the 
Positive Patch of FabA. Large CSPs in $36, L37 and D38 correspond 
with a change in the PPant position as it extends into the FabA. T63 
undergoes considerable rotation when comparing the octanoyl-AcpP 
and the AcpP-FabA complex. Ultimately, these CSP observations both 
complement and corroborate binding observations found in the crystal 
structure. 

To study the detailed dynamics of AcpP and its interaction with 
FabA, we measured residual dipolar couplings (RDCs)”* from weakly 
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Figure 2 | Structure of crosslinked AcpP-FabA. a, X-ray crystal structure of 
AcpP-FabA at 1.9 A. b, The molecular surface mapped with calculated vacuum 
electrostatic potential of AcpP-FabA. Blue shading indicates electropositive 
and red shading indicates electronegative protein surfaces. c, Rotation of b by 
90° at the interfaces between each AcpP-FabA to visualize electrostatic pairing. 
d, Expanded view of both interfaces in AcpP-FabA, indicating salt bridges 
and hydrophobic interactions between helix II (#2) and helix III («3) of AcpP 


aligned samples of octanoyl-AcpP and AcpP in the AcpP-FabA com- 
plex (Fig. 4a, b). The empirical RDCs were combined with accelerated 
molecular dynamics simulations” to provide structural-dynamic informa- 
tion in the slow RDC-optimized (microsecond) regime (detailed in Sup- 
plementary Information)”. Within this framework, we identified the 
optimal acceleration parameters, and hence optimal conformational 
space sampling criteria for the correlation of experimental and theor- 
etical RDCs (Fig. 4a), and calculated the averaged NH order parameters 
at both fast (nanosecond) and slow (microsecond) timescales (Fig. 4b). 
On nanosecond timescales, no substantial differences in the order para- 
meters between octanoyl-AcpP and AcpP-FabA are observed. By con- 
trast, in the microsecond time regime, substantial differences in the 
structural-dynamic behaviour of octanoyl-AcpP and AcpP-FabA are 
identified, indicating that AcpP is markedly stabilized in the presence 
of FabA, especially in the amino-terminal region of helix II and the 
helix II-III loop. There is a notable correlation between the AcpP-FabA 
binding interface observed in the crystal structure (Fig. 2), the NMR 


and the Positive Patch of FabA. e, Comparison between hydrophobic cleft 
of AcpP with sequestered substrate (top, from PDB 2FAE, with long interior 
hydrophobic cavity outlined with dashed line) and AcpP1 in AcpP-FabA 
(bottom, reduced interior cavity). f, The interior cavity of 2FAE labelled with 
the hydrophobic residues. The contraction of these hydrophobic residues 
collapses the interior cavity in AcpP-FabA. 


titration data (Fig. 3) and molecular dynamics simulations (Fig. 4b, c), 
all highlighting key dynamic residues. 

These results provide a window into the dynamic properties of AcpP, 
which sequesters elongated substrates in its interior cavity with motion 
at the helix II-III loop on a microsecond timescale. A probable order 
of events is that the Positive Patch of FabA first interacts with PPant 
attached to S36 of AcpP. Once in proximity, residues R132 and K161 of 
FabA form salt bridges with E41 and E47 on helix II of AcpP, anchoring 
the complex, whereas R136 and R137 serve to pry away helix III of AcpP 
through interactions with A59 and E60, thus disrupting shielding of the 
sequestered substrate. V40 and L37 on AcpP form hydrophobic inter- 
actions with L138 and V134 on FabA. All of these binding events serve 
to stabilize AcpP in an open conformation, allowing the sequestered 
substrate to release from AcpP and insert into the pocket of FabA as the 
AcpP hydrophobic cavity collapses. Together, these results provide an 
unprecedented verification of the switchblade mechanism (Fig. 1c). 
We surmise that the identity of the sequestered substrate can affect the 
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Figure 3 | NMR studies. a, HSQC titration spectra of "N-octanoyl-AcpP in equivalent of FabA and plotted by residue number. d, AcpP residues from 
the absence of FabA (green), and with increasing (yellow to red) concentrations __c, where CSPs larger than 0.065 parts per million (p.p.m.) are indicated in red. 
of FabA. CSPs are observed in AcpP residues that interact with FabA or e, CSPs measured between '°N-octanoyl-AcpP and '°N-AcpP-FabA were 
the bound acyl chain. In magenta is the overlaid HSQC of crosslinked '°N- measured and plotted by residue number. f, AcpP residues from e, where CSPs 
AcpP-FabA. b, Expanded views of select residues. c, CSPs were measured for _ larger than 0.25 p.p.m. are indicated in red. In NMR convention, protein residue 
each '°N-octanoyl-AcpP residue in the absence and presence of 1 molar number precedes residue letter; the converse applies with crystallography. 
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Figure 4 | Molecular dynamics and protein-protein interactions. octanoyl-AcpP and AcpP-FabA. Colour and thickness depict relative disorder, 
a, Experimental RDC data correlated with theoretical RDCs. !°N-octanoyl- where red represents maximal difference of 0.5 (detailed in Supplementary 
AcpP with Pf1 bacteriophage (left, black) and 5% neutral charge compressed _—_ Information). d, Residues of the Positive Patch mediating protein-protein 
polyacrylamide gel (left, red), and crosslinked AcpP-FabA with Pfl interactions in known structures. Blue, FabA; cyan, ACP-ACP synthase (PDB: 
bacteriophage (right, magenta). b, Order parameter calculations of octanoyl- 4DXE); purple, ACP-stearoyl-ACP desaturase (PDB 2XZO0); orange, holo 
AcpP (left) and AcpP-FabA (right). Nanosecond (blue) timescale compared ACP-ACP synthase (PDB 1F80); magenta, ACP-P450 (PDB 3EJB); yellow, 
to microsecond (RDC-optimized) (dotted) timescale routines. c, Sausage ACP-STAS (PDB 3NY7); and green, ACP-BioH (PDB 4ETW). 

plot of order parameter differences on the microsecond timescale between 
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positioning of helix II and helix III of AcpP, thereby modulating suc- 
cessful binding and switchblade events for selective catalysis. 

The application of crosslinking probes to gain structural insights now 
lays a foundation for defining the dynamic events associated with the 
mechanism of action of ACP. The approach can be applied to other 
carrier protein partners from primary and secondary metabolism such 
as FAS, polyketide synthase and non-ribosomal peptide synthetase*””””, 
as well as other carrier-protein-dependent pathways that have major 
roles in the treatment of cancer, obesity and infectious disease (Sup- 
plementary Table 9). 


METHODS SUMMARY 


All proteins used were overproduced in E. coli BL21(DE3) (Novagen) and purified 
by Ni-affinity followed by fast protein liquid chromatography. The AcpP-FabA 
complex was generated as reported previously and crystallized at room temper- 
ature (25°C) by sitting drop vapour diffusion at 30mgml' in 10mM sodium 
phosphate (pH 8.0), 350mM sodium acetate, 1 M LiCl and 35% PEG3350. Data 
were collected on beamline 12-2 at the Stanford Synchrotron Radiation Lightsource 
and beamline 8.2.2 at the Advanced Light Source and processed with HKL2000. The 
AcpP-FabA crystallographic phases were determined by molecular replacement 
using FabA and AcpP as the search template. Protein NMR data were collected 
at the University of California, San Diego Biomolecular NMR facility. Details of 
the molecular dynamics simulations are included in Supplementary Discussion. 
Detailed experimental procedures are described in the Supplementary Methods. 
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Prokaryotic viruses have evolved various mechanisms to transport 
their genomes across bacterial cell walls’. Many bacteriophages use 
a tail to perform this function, whereas tail-less phages rely on host 
organelles” *. However, the tail-less, icosahedral, single-stranded 
DNA ®X174-like coliphages do not fall into these well-defined infec- 
tion processes. For these phages, DNA delivery requires a DNA pilot 
protein’. Here we show that the ®X174 pilot protein H oligomerizes 
to form a tube whose function is most probably to deliver the DNA 
genome across the host’s periplasmic space to the cytoplasm. The 
2.4 A resolution crystal structure of the in vitro assembled H protein’s 
central domain consists of a 170 A-long a-helical barrel. The tube is 
constructed of ten a-helices with their amino termini arrayed in a 
right-handed super-helical coiled-coil and their carboxy termini 
arrayed in a left-handed super-helical coiled-coil. Genetic and bio- 
chemical studies demonstrate that the tube is essential for infectivity 
but does not affect in vivo virus assembly. Cryo-electron tomograms 
show that tubes span the periplasmic space and are present while the 
genome is being delivered into the host cell’s cytoplasm. Both ends 
of the H protein contain transmembrane domains, which anchor 
the assembled tubes into the inner and outer cell membranes. The 
central channel of the H-protein tube is lined with amide and gua- 
nidinium side chains. This may be a general property of viral DNA 
conduits and is likely to be critical for efficient genome transloca- 
tion into the host. 

Bacteriophage ®X174 is a small icosahedral microvirus with a cir- 
cular single-stranded (ss) DNA genome®. Electron microscopic and crys- 
tallographic studies have shown that the virions have spikes on each 
pentameric vertex’. The capsid, consisting of 60 F proteins, has an exter- 
nal diameter of about 260 A. Each of the 12 spikes consists of five G 
proteins, which protrude 32 A above the F-protein shell. The capsid 
also contains 60 copies of the DNA-binding protein J and 10-12 copies 
of the DNA pilot protein H®. Although a small portion of each H protein 
may be in the hydrophilic channel formed by the G proteins’, the struc- 
ture and location of the H proteins remained unknown as all structure 
determinations assumed icosahedral symmetry. The mature ®X174 
virion is devoid of a visible external tail. 

Capsids of tailed double-stranded (ds)DNA bacteriophages have a 
special vertex occupied by a dodecameric portal protein. Myoviruses, 
such as phage T4, have a contractile sheath surrounding the tail tube 
that contracts after the virus detects a host cell surface, thereby causing 
the specialized tip of the inner tail tube to puncture the outer membrane. 
The tail-associated lysozyme then digests the peptidoglycan layer before 
a tube component punctures the inner membrane. The genome then 
passes through the tail tube into the cytoplasm’. Podoviruses, such as 
T7, €15 and P22, eject internal head proteins through their portals that 
may also digest the peptidoglycan layer in the cell wall. Afterwards, they 
either assemble translocation tubes or lengthen their existing short tail 
tubes to stretch across the periplasmic space’*"’. By contrast, filamentous 


and tail-less icosahedral bacteriophages, with the notable exception of 
the microviruses, evolved to use host-cell-encoded channels**. 

The ®X174 DNA pilot protein H guides the DNA through the cell 
wall in the penetration process’. The genome-associated H proteins are 
transported to the cytoplasmic membrane, whereas the coat and spike 
proteins remain outside the cell'*. Because ®X174 does not have a tail, 
Jazwinski and colleagues speculated that the H proteins might form 
a DNA translocating channel through the cell wall’*. The H protein 
contains a predicted N-terminal transmembrane helix, rich in the 
(G/A)XXX(G/A) sequences known to promote membrane protein 
oligomerization’*”*. The protein’s central core has a potential coiled- 
coil structure’®. 

In attempts to crystallize the H protein (328 amino acids) it was 
necessary to remove some of the N- and C-terminal residues. The first 
successful structure solution was ofa fragment stretching from residues 
143 to 211 (143H211), which was determined by single-wavelength 
anomalous diffraction phasing. The structure consists of ten a-helices 
organized into an o-helical barrel. Using this as a model, a second longer 
structure (143H282) was determined by molecular replacement (Sup- 
plementary Table 1). Like the 143H211 structure, the structure of 143H282 
consists of ten nearly identical o-helical monomers that form a cylin- 
drical helical barrel. The cylinder is 170 A long and has an external 
diameter of approximately 48 A. The a-helices run parallel along the 
entire cylinder. Each monomer is kinked at residues Y193-A194-Q195, 
which divides the barrel into N- and C-terminal domains (Fig. 1a). 
Domain A (residues 144-194), which is very slightly conical, has a mini- 
mum internal diameter of approximately 22 A, whereas the internal 
diameter of domain B (residues 195-272) is approximately 24 A (Fig. 1b). 
The inner surface of the H tube has mostly negative electrostatic poten- 
tial (Fig. 1c). To our knowledge, this is the first description of a deca- 
meric coiled-coil structure’’. 

Most coiled-coil structures are dimers or trimers. The interaction 
between helices is determined by the insertions of side chains, or knobs, 
from one helix into the holes formed between side chains of the sur- 
rounding helices’*. In a structure with more than four identical helices, 
each helix interacts only with its two nearest neighbours. As the number 
of participating helices increases, the diameter of the central cylindrical 
cavity expands. This results in an empty tube with a water-accessible 
cavity, as opposed to the interacting hydrophobic surfaces observed in 
typical coiled-coil structures'’. In the H-protein tubes, the interaction 
between neighbouring helices is dominated by hydrogen bonds (Sup- 
plementary Table 2). 

The number of residues in one turn of an o-helix is non-integral and 
is expressed as the number of residues (n) in a given number of com- 
plete turns (t). Thus, the number of residues in a single helical turn is 
n/t. A conventional straight o.-helix has 3.6 residues per turn”’. How- 
ever, if n/t> 3.6 the helix forms a right-handed superhelix, and if 
n/t < 3.6 the superhelix is left-handed”’. In the present structure, the 
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Figure 1 | The structure of the H-protein coiled-coil tube. a, Ribbon 
representation of the structure. The monomers are individually coloured. 

b, Top view of the structure with the N terminus closest to the viewer. 

c, The inner surface of the H-protein tube is coloured according to the 
electrostatic potential. Blue and red colours correspond to positive and negative 
potential of 5kT/e’, respectively. d, The top view of the tube surface 

(the orientation of the tube is the same as in b). e, f, Superposition of the 
H-protein tube (cyan) and the bacteriophage fd capsid (purple), showing that 
the H protein has a similar inner diameter to fd (Protein Data Bank accession 
number 2HI5)*’. 


at-helices have 11 residues in three turns (11/3) in domain A and seven 
residues in two turns (7/2) in domain B. The inclination of the «-helices 
to the central cylinder axis in domains A and B is 5° right-handed and 
36° left-handed, respectively (Fig. 1a). 

In domain A, the 11 residues per structural repeat are identified by 
the letters a to k (Fig. 2b). The residue that has its side chain pointing 
towards the centre was arbitrarily designated residue a. Within the struc- 
tural repeat, the side chain of residue h would be similarly oriented. 
Seven of the eight residues at positions a or h are glutamines. In domain 
B there are seven residues per turn identified by the letters a to g (Fig. 2a). 
Only one amino-acid side chain within the heptad repeat points exactly 
towards the tube’s centre; again, this is designated as position a. How- 
ever, the long side chains of residues d and e may also assume inward 
orientations (Fig. 2c). Most of the inward-facing residues are glutamines. 
A few of them are asparagine and arginine residues. The occurrence of 
glutamine, asparagine and arginine residues that point towards the 
centre of the cylinder is also conserved in the homologous pilot protein 
of other microviruses (for example, coliphages «3 and G4) as well as 
the distantly related gokushoviruses (for example, Bdellovibrio phage 
@®MH2K and Chlamydia phage Chp2). These long side chains point to 
the N termini, which may suggest that the DNA is transported from 
the C to N termini. Like an outside-in harpoon, the side chains, which 
point to the N terminus of an o-helix, may prevent the DNA from 
going backwards. 

During assembly the external scaffolding protein D organizes twelve 
12S* particles into the procapsid. The 12S* particle consists of five copies 
each of proteins F, G and internal scaffolding B, but only one copy of 
protein H’'. Thus, if H-protein tubes are a component of the viral life 
cycle, some initial H-protein assembly most probably occurs after 
procapsid formation but before genome packaging. If this is correct, 
mutations that inhibit the requisite tube forming helix—helix interactions 
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should not affect procapsid assembly. However, if tube formation is 
essential for penetration of the genome into the host’s cytoplasm, the 
resulting particles should lack infectivity. Using the atomic structure 
as a guide (Fig. 2), residues that make monomer-to-monomer contacts 
were mutated. Mutant 1 (N161S + N187A) contains serine and alanine 
substitutions. Mutant 2 (N161S + N187A + Q224A) contains an extra 
alanine substitution (Supplementary Table 2 and Fig. 2). 

To determine whether the mutant H proteins affect particle forma- 
tion and/or infectivity, particles were purified from cells infected with 
mutant or wild-type viruses. The assembled particles were purified by 
rate zonal sedimentation. Wild-type and mutant virions sedimented at 
the same rate (Extended Data Fig. 1a). The mutant H proteins were 
incorporated into virions at wild-type levels (Supplementary Table 3). 
Mutant particles did not have an altered DNA content, as determined 
by D260 nm/D2g0 nm ratios (Supplementary Table 3). However, the mutant 
particles showed approximately three orders of magnitude lower specific 
infectivity (plaque-forming units/D ¢9 ,) than the wild-type virus (Sup- 
plementary Table 3). 

The ability of the mutant H proteins to form tubes was investigated 
in vitro. Cloning, gene expression and protein purification protocols 
were identical to those used for the wild-type H protein. However, size- 
exclusion chromatography showed that the mutant proteins migrated 
as lower-order oligomers (Extended Data Fig. 1b). The mutant H proteins 
were also probed by limited trypsin proteolysis. Whereas digestion of 
the wild-type 143H282 fragment generated the stable 143H221 frag- 
ment, the mutant proteins were digested into small pieces (Extended 
Data Fig. 1c). This indicated that some lysine and arginine residues, 
which are buried in the wild-type 143H221 fragment, are accessible in 
the mutant proteins, implying the failure of the mutant proteins to 
form tubes. Thus, the biophysical characterization of the mutant part- 
icles in vivo and mutant H proteins in vitro suggest that H-protein olig- 
omerization is critical for infectivity but not for particle morphogenesis. 

The H proteins self-assemble into a tube with dimensions suitable 
for DNA translocation and long enough to span the periplasmic space 
(Fig. 3) or a Bayer’s patch, a membrane adhesion site where infecting 
X174 particles have been suggested to congregate”. Secondary struc- 
tural prediction indicated a transmembrane helix at the N terminus”. 
Both N- and C-terminal regions are rich in alanine, glycine and serine 
residues, which have high occurrence in transmembrane helices (Sup- 
plementary Table 4). These regions may anchor the H tube in the inner 
and outer membranes. Tubular structures are also used for transport- 
ing the genome to the host’s cytoplasm by tailed bacteriophages such 
as T4 (ref. 9) and T7 (ref. 10), although for T4 the tube is attached to the 
head before infection whereas for T7 it is extruded from the head at the 
time of infection. For ®X174 it is unclear whether the H monomers 
begin to assemble as the genome is packaged or whether the tubes are 
assembled from H monomers at the time of infection (Extended Data 
Fig. 2). In the same way that tailed dsDNA bacteriophages use a virally 
encoded ATPase to package genomes into phage heads”, DX174 uses a 
host-cell-derived enzyme to package DNA into the phage procapsids. 
Presumably some of this packaging energy can subsequently be used to 
deliver the genome to the next host cell. 

As the ®X174 genome is a circular ssDNA molecule, the H tubes 
must accommodate two oppositely oriented DNA strands. The pack- 
aged genome contains mostly unpaired bases”. The minimum internal 
diameter of the H tube is approximately 22 A, which can easily accom- 
modate two unpaired ssDNA strands with intercalated bases”®. Simi- 
larly, the circular ssDNA genomes of filamentous bacteriophages (for 
example, M13 and fd) are packaged into cylindrical fivefold symmetric 
capsids (Fig. le, f) with comparable inner dimensions*”’. 

An electron microscopic tomographic investigation was used to detect 
the suspected H tubes crossing the periplasmic space of Escherichia coli 
C during ®X174 infection. However, as normal size E. coli cells are too 
large to allow electrons to cross while forming an image of such a cell, it 
was necessary to use ‘mini’ E. coli K12 cells'’. In turn, that required the 
use of the homologous and closely related microvirus ST-1 (Extended 
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Figure 2 | Helical wheel representations summarizing the inter-helical 
contacts in different parts of the helical barrel. a, The 7/2 heptad repeats of 
domain B. b, The 11/3 hendecad repeats of domain A. The amino-acid 
sequences arranged in 11- and 7-residue repeats are given on the right. 


Data Fig. 3) as DX174 does not infect K12 cells**. The ST-1 particles 
were seen to attach to the surface of the cell after incubating for 6 min or 
more (Fig. 3a-c). About 20% of the attached particles had a tube 
emerging from the capsid and, in most cases, crossing from the outer 
to the inner membrane of the cell (Fig. 3e, f). However, empty attached 
virions had lost the tube after the virus had delivered its genome to the 
host cell (Fig. 3g, h). 

To our knowledge, the H-protein structure represents the first virally 
encoded, cell-wall-spanning, DNA-translocating conduit determined 
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The residues that were mutated to disrupt formation of the H tube are coloured 
blue. Residues facing the interior are coloured orange. The inward-facing 
residues are highlighted in green. c, Stereo view of the residues lining the 
tube centre. 


at atomic resolution. Many bacteriophage protein assemblies that func- 
tion to conduct DNA have similar properties. For example, a domain of 
the P22 portal protein forms a 140-A-long «helical barrel, which shares 
sequence similarity with the ®X174 H-protein coiled-coil domain 
(Extended Data Fig. 4)”°. Although this domain is located within the 
capsid, dsDNA passes through it during DNA packaging and penetra- 
tion of the cell wall. Subsequently, the P22 DNA pilot protein, gp16, 
which contains a predicted coiled-coil domain, transports the DNA across 
the cell wall°®. Similarly, the gp14 and gp15 proteins of bacteriophage 


Figure 3 | Cryo-electron micrographic tomogram of the ®X174-like phage 
ST-1 infecting E. coli mini cells. a—c, Slices of tomograms showing three states 
of the infection process. d-h, Enlarged images taken from a-c. d, The virus has 
attached to the outer membrane (OM). One of the pentameric spikes of an 

icosahedral particle has recognized a lipopolysaccharide (LPS) molecule in the 
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outer membrane of the E. coli cell wall. e, f, After attachment, the virus extrudes 
a tube for DNA penetration. A tube can be seen (white arrow) crossing the 
periplasmic space, lodged in the outer and inner membrane (IM). g, h, After 
DNA has been injected into the cell, the extended tail starts to disassemble. 

i, Schematic model of ®X174 infection. 
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T7 that constitute the trans-cell-wall extended tail tube’! contain 
strongly predicted coiled-coil domains. The amino-acid sequence of 
gp14 is similar enough to that of the H protein to allow a reasonable 
sequence alignment (Extended Data Fig. 5). The spacing of the amide 
containing side chains, which line the H tube’s inner cavity, is similar. 

Like other phages, ®X174 seems to have a ‘tail’ that is required for 
infecting a host, but it protrudes from the virion only at the time of infec- 
tion. Just-in-time proteinaceous or lipidic tail formation also occurs 
when the T7 (ref. 11) and PRD1 (ref. 4) phages infect cells, respectively. 


METHODS SUMMARY 


The H-protein segment, corresponding to residues 143-282 (143H282), was cloned 
and expressed in E. coli. The purified protein was digested by trypsin and a smaller 
stable fragment, 143H221, was further purified. The Se-met 143H221 fragment 
was crystallized and the structure was determined by single-wavelength anom- 
alous dispersion. The structure of the longer crystallized fragment 143H282 was 
then determined by molecular replacement, using the structure of 143H221. Mis- 
sense mutations in the ®X174 H gene were generated by oligonucleotide-mediated 
mutagenesis. Mutated virus was extracted from cells that had been infected with 
mutated ®X174 DNA. The 143H282 protein mutants were cloned, expressed and 
purified in the same way as the wild-type 143H282 protein. Tomographic images 
of the homologous virus ST-1 infecting mini E. coli cells were taken on an FEI Titan 
Krios electron microscope. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature12812 


Corrigendum: A Jurassic 
mammaliaform and the earliest 
mammalian evolutionary 


adaptations 
Chang-Fu Zhou, Shaoyuan Wu, Thomas Martin & Zhe-Xi Luo 


Nature 500, 163-167 (2013); doi:10.1038/nature12429 


In this Article, the “S1-3’ label was misplaced in Fig. 1c, and the legend to 
Fig. 2c should state ‘Labial view’ rather than “Lingual view’. In addition, 
in Supplementary Figs 2d and 3c, the yellow and red labels for upper 
teeth were the wrong way round, and the legend to Supplementary 
Fig. 2 should state ‘labial’ rather than ‘lingual’ view, and from the ‘right 
side of skull in labial view’ rather than ‘left side of skull in lingual view’. 
These changes have been corrected in the HTML and PDF versions of 
the original paper and its Supplementary Information. 
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LIFEIN THE DANGER ZONE 


Instruments for studying microbes under biological containment cannot be readily 
removed from labs for servicing. A US facility is finding ways around that problem. 


A laboratory in Frederick, Maryland, will soon bring a high degree of automation and imaging capability to research under biosafety-level-4 conditions. 


BY VIVIEN MARX 


deadly — natural threats to humans 
and the environment as well as poten- 
tial bioweapons. To counter such hazards, 
laboratories that study these pathogens and 
substances must do so under high security. 
One such lab is opening this spring in Fred- 
erick, Maryland. It will be designated as 
biosafety-level-4 (BSL-4), the highest level of 
biological containment. 
The lab is part of the Integrated Research 
Facility (IRF), a complex operated by the US 


yee bacteria, viruses and toxins are 


National Institute of Allergy and Infectious 
Diseases (NIAID) that has been opening in 
stages since 2008. Although some of the equip- 
ment in the 1,020-square-metre lab is standard 
for a BSL-4 environment, the degree of auto- 
mation and integration is unprecedented, says 
Peter Jahrling, director of the IRE 

For example, a complete imaging suite and 
clinical area lets researchers study and treat 
infected animals without having to remove 
them from containment. “We've basically built 
an intensive care unit for animals,” says Jahr- 
ling. Likewise, the instruments used to screen 
and study blood and tissue samples have been 


customized to minimize the need for handling 
by people. In some cases, almost every step has 
been automated, with robots doing assays to 
check virus concentrations or assess immune 
responses. 

Until the lab receives its final BSL-4 
approval, which is expected early this year, 
its staff is working on slightly less dangerous 
microbes, such as the viruses that cause cow- 
pox and Middle East respiratory syndrome 
(MERS). But the researchers are already 
following BSL-4 procedures — wearing pres- 
surized whole-body suits with air piped in 
through hoses, and taking disinfecting 
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A CUSTOM JOB 


Liquid-handling robots in a biosafety-level-4 lab have to be customized in many ways. 


Air duct 


> chemical showers in the suits when exiting 
the lab. “It allows our staff here to gain practice 
in working with a human pathogen in a BSL-4 
environment,’ says Lisa Hensley, a microbiolo- 
gist and the IRF’s associate research director. 

Once the lab has final approval, Hensley 
and her team will move on to study the Ebola, 
Hendra and Nipah viruses. “We'll be able to 
see what's really happening in animals,’ she 
says, and to assess potential treatments. The 
lab’s unique ability to image live specimens in 
the BSL-4 environment will help researchers to 
cut down on the number of animals killed, and 
will avoid the need for extensive and numerous 
post-mortem examinations, which increase 
the risk to staff. 

But installing robots and complex imag- 
ing equipment in the lab has called for careful 
engineering. 


CLEAN IMAGING 
In 2012, scientists at another NIAID BSL-4 
lab, in Hamilton, Montana, showed that the 
infection that causes MERS in humans could 
be modelled in macaque monkeys (V. J. Mun- 
ster et al. N. Engl. J. Med. 368, 1560-1562; 
2013). The scientists there could do blood 
work on these animals and take X-rays, says 
Hensley, but more detailed imaging data are 
crucial for identifying lung regions affected 
by the disease, pinpointing possible locations 
where inflammation begins and assessing pos- 
sible treatments. “You can look to see if your 
countermeasure is working,’ says Hensley. 
That is why the new IRF lab has a hospital- 
quality imaging suite that includes an X-ray 
imager, a 3-tesla magnetic resonance imaging 
machine and scanners that combine computed 
tomography with positron emission tomog- 
raphy or single-photon emission computed 


Automated 
balance 


Simplified deck 
with fewer devices 


HIGH-SECURITY LABS 


TECHNOLOGY 


A liquid-handling robot designed for use by scientists in bulky protective gear. 


tomography (see ‘Hot and cold’). 

But each piece of equipment has many mov- 
ing parts and electronic components that need 
regular servicing, says Jahrling. “There's no 
way you can put that in a BSL-4 lab and expect 
it to function for very long,” he says. And ina 
lab with such high biosafety requirements, “the 
service people aren't coming in’, says Hensley. 
The equipment lives in containment and can 
leave only if thoroughly decontaminated, yet 
no instrument can handle a chemical shower. 
The alternative — treatment with vaporous 
formaldehyde or hydrogen peroxide — can 
also cause damage. And even if the instru- 
ment were to survive, the process would be 
time-consuming. 

So before the IRF scientists installed imaging 
equipment in the BSL-4 lab, they collabo- 

rated with engineers 


“The more at Philips Health- 
things youaddto care, which has its 
arobot,themore US headquarters in 
things can go Andover, Massachu- 
wrong. » setts, to completely 


re-engineer the 
instruments. “That was a huge research and 
development effort,’ says Jahrling. 

The main sections of the imaging machines 
are installed on one side of a wall — the ‘cold’ 
or non-contained side. On the ‘hot’ side is a 
patient table onto which an anaesthetized 
animal is placed for imaging. The table is 
mounted on tracks and can be moved through 
a secure transparent tube that extends into the 
cold side of the lab but is sealed to prevent the 
escape of pathogens. Researchers then slide the 
table through the tube to the imaging appara- 
tus on the cold side. This design allows service 
staff to maintain the instrument outside the 
contained lab. 


Four liquid-handling robots have been 
installed at the IRE, two of which are in the 
BSL-4 lab. The instruments are used to extract 
DNA and RNA from tissues and to perform 
assays that involve titration and staining. The 
robots cap and uncap tubes, weigh them and 
dispense various reagents. 

In plaque assays, used for calculating the 
concentration of virus in a sample, virus is 
added to a multiple-well plate that contains 
cells and culture medium. After a certain 
amount of time, visible holes or ‘plaques’ form 
where the virus has infected and killed cells. 
The assay requires multiple steps that include 
serial dilution, weighing samples and adding 
buffers. 

In neutralization assays, used for assessing 
immune responses, scientists can see how well 
antibodies interfere with an infectious agent. 
When robots tend to these assays, there is less 
variability than when a number of different 
staff members perform the task, says Hensley. 
Less handling of infectious and toxic agents is 
also safer for scientists, she says. 


SIMPLIFIED ROBOTS 

Automation brings its own problems. For 
example, Jahrling says, “there's a concern that 
the repetitive manipulations that the robot 
performs might generate an aerosol” — tiny, 
airborne droplets that can spread infectious 
agents. To reduce the risk, some instruments 
were enclosed and tested to ensure that perfor- 
mance was not impaired. 

Kenny Ung, an engineer at Tecan, a 
company based in Mannedorf, Switzerland, 
and his colleagues worked with the IRF team 
to customize Tecan’s Freedom EVO liquid han- 
dlers for use in the lab. The robots are enclosed 
by safety shields. 
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Tecan customizes instruments for com- 
plex workflows in which robotic arms move 
between carousels, bar-code scanners, incu- 
bators, shakers, sealers and other devices. 
But the machines needed to be simplified 
and customized for use in this lab. Given the 
space constraints and difficulty servicing 
instruments, Ung says, “you don’t want one of 
those types of systems inside BSL-4”. But the 
IRF team wanted some degree of automation 
in their assays so that many samples can be 
prepared at once. “A robot doesnt get bored, 
doesn’t get tired,” says Ung. 

In the IRF’s BSL-4 lab, robots are pro- 
grammed for sample preparation with fewer 
additional devices and with as little human 
intervention as possible (see ‘A custom job’). 
A robotic arm will pick up tubes containing 
tissue or blood samples, move them to an 
automated balance, uncap and cap them and 
add buffer as needed. Another robot dispenses 
liquids for serial dilution of samples, which are 
then transferred to 96-well plates to be incu- 
bated and analysed. 

Tecan’s engineers customized the liquid 
handler to deal with this succession of tasks, 
but kept the robot simple. They added an auto- 
mated balance and a bar-code reader to scan 
and keep track of the sample tubes, but decided 
not to include a sample-mixing sonicator — 
instead, the IRF scientists use an external 
mixer. “The more things you add to a robot, 
the more things can go wrong,” Ung says — 
which is true especially in a BSL-4 environ- 
ment, where repair needs must be kept to a 
minimum. 

Ung knew that scientists would be oper- 
ating the instrument while wearing protec- 
tive suits, so he decided to simulate their 


HOT AND COLD 


The imaging suite at a US lab in Frederick, Maryland, has a ‘hot' and a 'cold' side for biosafety-level-4 research. 


situation. “I wanted to feel what it’s like to be 
the user inside the lab,” he says. The IRF team 
sent him a safety suit to wear while he worked 
on the machine. 

One of Ung’s tasks was to ensure that the 
equipment had no sharp edges that could 
puncture the suit. In 
addition to install- 
ing safety shields 
on the instrument's 
sides and tops, he 
filed and sanded the 
edges on the instru- 
ment and on its 
aluminium stand. 

Healso elevated the 
instrument slightly 
and added a drip 


“You just pan so that scientists 
getused to : could capture and 
disconnecting clean spills easily. He 
your air and and his colleagues 
walking ten added air ducts, 
feetdowntothe adjusting the air flow 
nextairlineand _ to be strong enough 
reconnecting. a to remove potentially 


pathogen-laden aero- 
sols but not so strong 
as to disrupt liquid-dispensing or weighing. 
Increased air flow can prevent a balance from 
stabilizing, which would lead to flawed readings. 

Before installing the machine, the IRF team 
visited Tecan’s labs to verify that it worked as 
specified. The instrument was then moved to 
the BSL-4 lab, where it was reassembled and 
tested. Tecan worked out a maintenance plan 
for the instrument. “I’m not able to go in there, 
so they basically need to have skills on their 
side to remedy the problem,” says Ung. Cindy 


Peter Jahrling 


Patient table slides 


through the wall 


=| Hot (pathogens present) 


Allan, a biomedical engineer at the IRE, has 
learned how to take the robot apart and put 
it back together. She also learned the tricky 
task of changing the pipette tips, diluters and 
syringes. Biothreat analysis calls for genetic 
and protein-based tests, says Amy Altman, 
who directs biodefence at Luminex, an instru- 
ment manufacturer in Austin, Texas. Anti- 
body-based probes are used to detect proteins 
such as ricin or Clostridium botulinum toxin, 
for example. And some viruses can have very 
low concentrations in the blood, so detecting 
them requires sensitive genomic tests. 


FAST FLOW 

In the IRF’s BSL-4 lab, researchers will be using 
Luminex’s FlexMAP 3D, which is based on 
flow cytometry, a technique in which lasers 
are used to count cells. In this instrument, 
specific types of genetic material or protein are 
attached to beads coloured with multiple fluo- 
rescent dyes at different ratios. The beads move 
in single file past a laser that emits red light 
to excite the dyes. The emitted wavelength of 
light identifies the type of bead. A green laser 
then excites a ‘reporter’ dye in the bead that 
determines which protein or nucleic acid it is 
attached to. “As each bead gets interrogated by 
the laser, you can think of each bead as its own 
little test tube,” says Altman. 

To screen a sample for ten strains of Ebola 
virus, for example, researchers use an array of 
beads that each carry snippets of DNA specific 
to one of the strains. A blood sample from a 
patient who succumbed to Ebola can therefore 
be tested to determine which strain caused that 
person's death. 

The platform works quickly and can dis- 
tinguish between up to 500 genes or proteins 


Cold (pathogens not present) 


Containment tube to 
image infectious animals 


1 | Single-photon emission computed tomography and computed tomography (SPECT-CT) 3 | X-ray imaging 
| 4 | Positron emission tomography and computed tomography (PET-CT) 


B Magnetic resonance imaging (MRI) 
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simultaneously. Such ‘multiplexing’ means that 
scientists can check for many signatures of a 
possible infectious agent at once, Altman says; 
a ‘true positive’ for a pathogen would be when 
all signatures of the disease agent are present. 
Multiplexing platforms are used in biothreat 
monitoring, which sometimes entails “high- 
consequence, high-regret decisions”, such as 
whether or not to shut down an airport, says 
Altman. 

To ensure that the hardware could operate in 
the IRF’s BSL-4 lab, Luminex scientists tested 
whether the instrument could be safely decon- 
taminated with vaporous hydrogen peroxide 
and paraformaldehyde. Neither substance is 
usually kind to instruments, but in the case 
of the Luminex machine, “it made it through 
fine’, Altman says. 


BACTERIAL HIGH ALERT 

Scientists in the BSL-4 lab will also be using 
the BacT/ALERT 3D 60, an automated 
system made by bioMérieux, based in 
Marcy-l Etoile, France, that tests samples of 
body fluid for fungi, mycobacteria or bac- 
teria — such as Yersinia pestis, which causes 
bubonic plague, or the anthrax culprit Bacillus 
anthracis. 

To accomplish this task, the instrument has 
a built-in incubator that rocks sample bottles, 
swirling their contents to enhance microbial 
growth. At the bottom of each bottle is a dab 
of dried silicone that contains a dye and acts as 
a pH sensor. When the pH in a bottle changes 
because of microbial respiration, the sensor’s 
colour changes from blue-green to yellow. A 
detection system illuminates the bottles peri- 
odically and alerts lab staff to any change in 
sensor colour, says Doug Matthews, product 
manager at bioMérieux. 

“Every ten minutes, it takes a reading and 
will alert you if it detects growth,” Matthews 
says. Speed is important in hospitals, where 
this instrument is often deployed, as well as 
in clinical-testing labs and the pharmaceutical 
and food industries. But in a BSL-4 lab, rapid 
detection is even more crucial, he says. 

After detection through this automated 
incubator, scientists can then confirm their 
findings with other steps. A sample will be 
grown in an agar dish to identify specific 
pathogens, for example, or to test how well an 
antimicrobial agent might work. Research- 
ers might also extract genetic material for 
sequencing. 

The BacT/ALERT 3D 60 was not made 
specifically for BSL-4 labs, but a number of its 
design features help it to work in that environ- 
ment, says Matthews. Despite its compact 
size, it can handle 60 samples. Its touch screen 
can be operated by heavily gloved hands. The 
instrument is designed to be hardy — company 
records of installed instruments show that the 
machines do not need major service or repair 
for an average of 1,000 days, he says. The sample 
bottles are made from nylon sandwiched 


HIGH-SECURITY LABS (uae e)meley) 


In a custom computed-tomography scanner, animals can be imaged without contaminating equipment. 


between two layers of polycarbonate to resist 
shattering and spreading infectious cultures. 
“Keeping glass out of a BSL lab is obviously a 
key safety feature,” Matthews says. 

Later this year, bioMérieux intends to launch 
a new model of the instrument, called the 
Virtuo, in Europe. The Virtuo handles more 
bottles than the BacT/ALERT 3D 60 and auto- 
matically places them in racks after scanning 
their bar codes. The instrument is now in clin- 
ical trials, says Matthews, and the company is 
slated to file for approval with the US Food and 
Drug Administration in 2015. 


CELLULAR CONNECTION 
The IRF’s new BSL-4 lab is subdivided into 
rooms for different tasks, such as sample 
preparation, cell culture, animal care or imag- 
ing, and tests for clinical pathology, molecular 
biology and virology. But moving between 
those rooms is not simple, because scientists 
in protective gear are tethered by an air hose 
that stretches only so far. To venture farther, 
they have to detach from their air supply. “If 
you disconnect your air you could last for ten 
minutes,’ says Jahrling. “You just get used to 
disconnecting your air and walking ten feet 
down to the next air line and reconnecting.” 
Given the multitude of tasks researchers do in 
the IRF lab, they need to confer often. Years of 
working in ‘space suits’ have made Hensley and 
Jahrling fluent in non-verbal communication 


— bya wave ofa gloved hand, for example. For 
the new lab, they wanted to go one better — in 
part because they noted a cultural shift among 
younger scientists, who seemed to want to com- 
municate more when in the BSL-4 lab. 

After testing various communication sys- 
tems, they chose mobile phones and Bluetooth 
headsets. The phones are external to the suits; 
the headsets are internal. “You pick up the cell 
phone on the hot side of the suit? Hensley says. 

Communication and collaboration have 
been a goal for the lab since 2005, when con- 
struction of the IRF began as part of a plan to 
bolster US biodefence research. To enable col- 
laboration, the IRF is located next to the US 
Army Medical Research Institute of Infectious 
Diseases (USAMRIID) and labs run by the US 
Department of Homeland Security. Both Jah- 
rling and Hensley previously worked at the 
USAMRIID. 

Jahrling says that those neighbouring labs 
are the subject of much rumour and specula- 
tion, but that no secret research is done at the 
IRF. On the contrary, he has extended an open 
invitation to scientists from other biosafety 
research facilities to visit, so that they can see 
for themselves what it is like to battle the most 
noxious weapons from nature's arsenal in an 
up-and-running, next-generation BSL-4 lab. m 


Vivien Marx is technology editor for Nature 
and Nature Methods. 
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Rare insights 


Scientists who specialize in uncommon diseases can find 
aresearch focus with a purpose. 


BY HEIDI LEDFORD 


teven Gray used to spend long hours in 
the lab for the simple love of science. As 


a postdoctoral researcher, he was tinker- 
ing with a virus in search of ways to shuttle 
genes into nerve cells for gene therapy. Then, 
in 2008, his adviser sent him to a meeting held 
by a non-profit organization called Hannah's 
Hope Fund, and Gray found a new inspiration. 

Hannah's Hope Fund is a charity based in 
Rexford, New York, that supports research on 
giant axonal neuropathy (GAN), a fatal nerve 
disorder. At the meeting, Gray met Hannah 
Sames, a clumsy four-year-old with tight 
curls and a sweet smile whose disease had 
inspired her parents to start the charity. He 
launched a GAN project after the meeting. “I 
looked at her and saw my own daughter,’ says 
Gray, whose child was then also four. 
“Now I’m focused on finding a treatment, 


almost as I would for my own child” 

GAN is one of more than 6,000 rare, or 
‘orphan, diseases that affect humans world- 
wide. Such diseases typically strike fewer than 
1 in 2,000 people, and present unique chal- 
lenges to researchers and drug developers, who 
have access to limited numbers of participants 
for clinical trials and few resources such as ani- 
mal models. 

But for scientists who can overcome such 
challenges, the rewards can be tremendous. 
From a practical perspective, an increasing 
interest from industry and available government 
funds dedicated to rare diseases have brought 
new job options. “There are great opportunities 
for people in academia to interact with pharma 
and to access government funding,” says 
Daniel Ory, who studies Niemann-Pick type C 
disease, a genetic neurodegenerative disorder, 
at Washington University in St. Louis, Missouri. 

Rare-disease research also offers rich returns 


for scientists who thrive on interaction, adds 
Ory. Because of the small numbers affected and 
a dearth of information about most rare dis- 
eases, scientists must work closely with patients 
and their families. They also frequently collab- 
orate with patient-advocacy organizations to 
gather tissue samples, learn more about symp- 
toms and recruit subjects for clinical trials. 
Personal interaction presents challenges: many 
rare diseases are fatal, and they often affect 
children. “It really doesn't get any more raw 
or emotional,” says Nick Leschly, president of 
biotechnology firm bluebird bio in Cambridge, 
Massachusetts, which is seeking treatments for 
several rare diseases. 


HIRING OUTLOOK 

Thanks mostly to the fervour of patient advo- 
cates, research into rare diseases is booming. 
Philanthropic donations have allowed univer- 
sities to set up rare-disease centres, such as the 
University of Pennsylvania's Center for Orphan 
Disease Research and Therapy in Philadelphia, 
which has awarded more than US$4.1 million 
in grants to its researchers since it opened in 
2011. In 2012, the US National Institutes of 
Health (NIH) awarded $3.6 billion for rare-dis- 
ease research, including supporting dedicated 
initiatives such as its Therapeutics for Rare and 
Neglected Diseases programme. The European 
Commission's Seventh Framework Programme 
for research funding spent an estimated €530 
million (US$720 million) on orphan dis- 
eases between 2007 and 2013, according 
to EURORDIS, an alliance of rare-diseases 
patient organizations based in Paris. 

But it is in industry that the field has really 
taken off. Pharmaceutical companies that once 
shied away from developing drugs for small 
markets have learned from success stories 
such as Genzyme, an orphan-disease company 
based in Cambridge, Massachusetts. Genzyme 
built a booming business, compensating for the 
small market by charging high prices. (There 
are programmes to help patients to pay for their 
drugs, but the pricing remains controversial — 
see ‘Cost conundrum.) Other companies are 
now flocking to take advantage of regulatory 
incentives: in the United States, firms some- 
times receive tax credits for clinical trials of 
orphan-disease drugs, and US and European 
regulators often streamline the approval of such 
medicines. One-third of the 39 drugs approved 
by the US Food and Drug Administration 
(FDA) in 2012 were for orphan diseases, and 
the global market for them is expected to grow 
from $86 billion in 2012 to $112 billion in > 
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> 2017, according to BCC Research, a market- 
research firm in Wellesley, Massachusetts. 

As a result, orphan-drug development is 
teeming in small biotechnology firms such as 
bluebird bio, which almost doubled its staff by 
adding about 40 new employees this year, most 
of them scientists. Large pharmaceutical com- 
panies, including London-based GlaxoSmith- 
Kline (GSK), have developed specialized units 
that focus on rare diseases. Hans Schikan, chief 
executive of Prosensa, a rare-diseases company 
in Leiden, the Netherlands, was surprised when 
GSK approached him in 2009 about invest- 
ing in his company’s research, because he had 
not realized that GSK had any interest in rare 
diseases. The resulting partnership brought 
Prosensa $25 million, with added payments 
as the company hit milestones in its Duchenne 
muscular dystrophy projects. Since then, says 
Schikan, the field has blossomed. Other large 
pharmaceutical companies such as Novartis in 
Basel, Switzerland, have increased their focus 
on rare diseases, creating job opportunities. “It's 
stilla fairly young space,’ says Schikan. “But it's 
growing” 


MULTIPLE NICHES 
There are many different kinds of orphan dis- 
ease, and interested researchers can come from 
a variety of disciplines. Veterans in the field 
recommend that prospective researchers brush 
up on statistics because small sample sizes can 
require sophisticated analyses. Experience in 
the latest genome-sequencing techniques is a 
boon: the technology has unearthed the genetic 
basis for many rare diseases in recent years. 
“We're seeing many of the tools and technolo- 
gies developed for common diseases now being 
applied for the first time in rare diseases,” says 
Robert Steiner, executive director of the Marsh- 
field Clinic Research Foundation in Wisconsin. 
And orphan diseases provide an opportu- 
nity to carve out a niche and make contribu- 
tions with little fear of being scooped, Steiner 


adds. “There is often less competition,” he says. 
“And I’ve felt as if everything I was doing had 
the potential to add significant knowledge to 
an area where there wasa real gap.” 

But carving out a niche can be isolating, 
as Josh Sommer learned first-hand. In 2006, 
as an undergraduate at Duke University in 
Durham, North Carolina, Sommer was diag- 
nosed with chordoma, a rare spinal cancer. 
He later decided to work in a lab that studied 
the disease, and Duke, as it happened, was 
the country’s only university with a federally 
funded chordoma lab. But it did not take long 
for Sommer to learn about the frustrations of 
rare-disease research. “We didn’t have access 
to tissue samples, cell lines or mouse models,” 
he says. “And we didn't have other labs to reach 
out to or collaborate with. It was lonely:” 

Sommer left the university after his third year 
to co-found the Chordoma Foundation, also 
based in Durham, where he remains execu- 
tive director. One of his first initiatives was an 
annual workshop to bring together chordoma 
researchers and other scientists whose work 
may havea bearing on the disease. The founda- 
tion also sponsors $10,000 prizes for scientists 
who develop useful preclinical models. The cash 
is not much, says Sommer, but it has incentiv- 
ized several laboratory technicians to develop 
chordoma cell lines and mouse models. So far, 
the foundation has distributed three cell lines to 
60 labs and companies, and Sommer says that 
he stays in contact with more than 150 research- 
ers whose work may be relevant to the disease. 

Few preclinical models are available to rare- 
disease researchers, but this can be a chance to 
create high-quality models in one’s own lab, 
says Steiner. His biggest worry is making the 
case for funding. Although the NIH supports a 
range of orphan-disease research, Steiner says 
that he and his colleagues still occasionally 
receive grant reviews questioning the impor- 
tance of their work. “Some study sections for 
federal funding agencies are still focused on 


AT WHAT PRICE? 


Cost conundrum 


One way in which companies have made 
a profit from orphan drugs is by charging 
high prices —a trend that can frustrate 
researchers intent on helping patients. 
For example, the cystic fibrosis 
drug Kalydeco (ivacaftor), made by 
Vertex Pharmaceuticals of Cambridge, 
Massachusetts, can cost individuals a 
staggering US$373,000 per year. Drug 
companies say that they need to charge 
such high prices to recoup the cost of 
developing drugs for a tiny number of 
people. People with health insurance may 
not have to shoulder much of this cost, and 
many companies, including Vertex, have 


global financial-aid programmes to help 
those without insurance. 

But physicians and researchers are voicing 
concerns about whether these programmes 
are sufficient for ensuring access, and 
question how sustainable the high prices 
are. Charging so much for the drugs 
disregards the contribution of academic 
researchers and clinical-trial participants 
to drug development, argues Carlos Milla, a 
paediatrician who studies rare diseases at 
Stanford University in Palo Alto, California. 
“It’s more than just the company and the 
investors,” he says. “There was a whole 
community that contributed to this effort.” HL. 
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STEVEN GRAY 


Steven Gray (far right) was inspired to launch a project on giant axonal neuropathy, a rare disease. 


the idea that the significance of a project 
is directly proportional to the number of 
patients affected,” he says. 


SECURING FUNDS 
One method for boosting funding oppor- 
tunities is to look for ways in which a rare 
disease overlaps with a more common one. 
Heather Bean, a chemist at Dartmouth Col- 
lege in Hanover, New Hampshire, is using a 
two-year postdoctoral fellowship from the 
Cystic Fibrosis Foundation in Bethesda, 
Maryland, to support her studies of bacterial 
lung infections associated with cystic fibro- 
sis. But she hopes eventually to expand her 
funding opportunities by exploring overlaps 
between cystic fibrosis and a more common 
ailment, chronic obstructive pulmonary dis- 
ease. “I’m still committed to looking at cystic 
fibrosis,” she says. “But drawing those links 
to a bigger, more fundable disease is handy.” 
Some labs thrive by pulling in grants and 
fellowships from multiple foundations. 
Claudio Hetz, co-director of the Biomedical 
Neuroscience Institute at the University of 
Chile in Santiago, studies protein folding, 
which goes awry in several rare diseases 
including Creutzfeldt-Jakob disease, Hun- 
tington’s disease and amyotrophic lateral 
sclerosis. Hetz was worried about finding 
funds when he left his postdoc in the United 
States to open his own lab. He applied to a 
slew of foundations, hoping to get an award 
from one. He received three grants. “It was 
the starting point for everything,’ he says. “It 
allowed me to build a solid lab really fast” 
Hetz rattles offa list of five foundations that 
he works with, and says that he has forged 
personal relationships with people at each of 
them. Some, such as the Michael J. Fox Foun- 
dation for Parkinson's Research, based in New 
York, do more than just hand over money. 
Hetz says that he contacts a programme 
officer there when he encounters a technical 


stumbling block, and the officer works with 
him to find the right scientist to consult. 

Yet there can be drawbacks to foundation 
grants. The awards are often smaller than 
government grants — Gray, who now runs 
his own laboratory at the University of North 
Carolina in Chapel Hill, says that the largest 
of his seven foundation grants is still just shy 
of $250,000 a year. His smallest have values of 
about $50,000 per year. And the grants often 
last only a year or two, creating a sense of 
instability. Furthermore, foundations rarely 
pay for full operational costs — such as build- 
ing maintenance and administrative support 
— that universities typically take out of gov- 
ernment grants. For that reason, universities 
say that they can lose money on the awards, 
and sometimes force staff to decline them. 
Researchers who are interested in compet- 
ing for foundation money would therefore 
do well to check with their institutions to find 
out if they can accept the funds (see Nature 
504, 343; 2013). 

Foundations also expect their grant recipi- 
ents to remain focused on the goal of helping 
patients. Gray warns applicants to his lab that 
this will sometimes mean dropping scientifi- 
cally interesting experiments if they do not 
obviously contribute to the project's main 
mission. “We really make sure that every- 
thing we're doing is in the best interest of the 
people that are funding us,” he says. 

Gray is comfortable with that compromise. 
Last year, he applied to the FDA for approval 
to conduct a GAN clinical trial. He counts 
several people with GAN and their families 
among his friends. His voice is strained when 
discussing the recent death of an adult with 
the condition whom he met at that original 
meeting. “It’s tough,” he says. “You're always 
trying to work a little harder? = 


Heidi Ledford reports for Nature from 
Cambridge, Massachusetts. 
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GENDER 
Female speakers 


Having at least one woman on the speaker- 
recruiting team for a scientific conference 
boosts the number of female speakers, 
finds a 7 January study (A. Casadevall 

and J. Handelsman mBio http://doi. 
org/qsh; 2014). The authors examined 

460 symposia with a total of 1,845 speakers 
at two annual meetings sponsored by 
Washington DC’s American Society for 
Microbiology in 2011-2013. They focused 
on 104 all-male convener teams and 112 
with at least one woman. When at least one 
woman was on the team, the proportion 

of female speakers rose from an average of 
25% to 43%. Co-author Arturo Casadevall, 
a microbiologist at the Albert Einstein 
College of Medicine in New York, says that 
early-career female scientists can benefit 
from volunteering to be speaker recruiters. 


SALARIES 
Living in the present 


Prospective biomedical postgraduate 
students decide whether to enrol on 

the basis of current salaries rather than 
potential future earnings, says a study out 
on 23 December (M. E. Blume-Kohout 
and J. W. Clack PLoS ONE http://doi. 
org/qsq; 2013). Data for 1996-2010 
showed that postgraduate enrolment 
for a given year rose by 2.9-3.9% when 
relative wages for biomedical-science 
posts rose by 1%. But enrolment in 

a given year did not correspond to 
salary changes six years later, around 
graduation time. Prospective students 
should consider effects on salary trends, 
such as dips in agency budgets, says 
co-author Meg Blume-Kohout, a senior 
research economist at the New Mexico 
Consortium in Albuquerque. 


FEMALE RESEARCHERS 
Ireland lines up grants 


The Irish government plans to launch one- 
and two-year postdoctoral fellowships, 
worth up to €185,000 (US$252,000) each, 
to prompt early-career female researchers 
to stay in or return to the scientific 
workforce after childcare or other breaks. 
Science Foundation Ireland (SFI) in 
Dublin will announce the 20 or so Advance 
Fellowship grants by June, says Elena 
Martines, the SFT’s scientific programme 
officer. Currently, 35% of SFI-funded Irish 
postdocs and 20% of SFI-funded academic 
researchers are women, says Martines. 
“This is a very bold programme,’ she says, 
noting that few similar initiatives exist. 
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SECO 


BY KEN LIU 


he waitress brings me a glass of 
ik juice. She notices that I’m 

wearing gloves even though it’s the 
middle of summer, but she just shrugs and 
leaves. Too young to recognize me, I guess. 
Or maybe she’s still embarrassed by how I 
got kicked out of the game. 

I sip the juice through a straw. On the 
jumbo-sized TV hanging over the bar, 
they’re showing the Red Sox- Yankees game. 

A new player steps into the batter's box. 

“Look at that stance. It’s like watching 
a historical film,” the colour analyst says. 
“Gives you the shivers.” 

Herman Ruth settles in, his six-foot-two, 
180-pound frame filling the super-hi-def 
view. He's in prime shape. I’m sure the Red 
Sox training staff watch his diet like hawks. 

“What do you think about the court deci- 
sion?” asks the play-by-play announcer. 

“T think they got it right. I don’t see how they 
can stop him from playing. It’s not his fault 
that he’s cloned from Babe Ruth, you know? 
The kid just loves the game. Maybe he'll have 
a better record than even the original.” 

“And you think the small fine is fair?” 

“Yeah, I think so. Look, the Red Sox didn't 
force the kid. They cloned him and found a 
loving family to adopt him. They didnt tell 
him who he was and left him alone for 18 
years. Then they show up and offer him a job 
playing a game he loves. I just wish they hadn't 
disturbed the original Babe Ruth’s grave.” 

“What about the family’s objections?” 

“Like the court said, your family doesn't 
own you. You gotta give it to the Red Sox for 
coming up with this trick to make up for that 
debacle of a trade a century ago. A second 
chance for the Sox and the Babe!” 

Ruth swings and there's the solid crack of 
wood on leather. He stands there for a frac- 
tion of a second, watching the ball sail on 
its inevitable trajectory out of the ball park. 
Then his face breaks into that famous grin, 
and he begins the trot around the bases. 

I get up and pocket the straw. I take out 
some cash with my gloved hands and leave 
it on the table. 

I’ve been voted the AL MVP three times. 
All-Star more than a dozen times. I would 
have had the all-time home-run record. 

I could have been a 


D> NATURE.COM great, no, the greatest, 
Follow Futures: player in history. 
Y @NatureFutures I wort leave any of 
EA go.nature.com/mtoodm © my cells behind. 


A secure future. 


“Let me introduce you to Doctor Danzer,’ 
my agent Scott says. “She's as good a scien- 
tist as lam an agent.’ I haven't seen Scott in 
years. In fact, he’s been avoiding my calls ever 
since my disgrace. 

I shake the woman’s hand. “Pleased to 
meet you.” 

“I was a fan,’ she says. I note the past tense. 

“Of all my clients,” Scott says in a tone that 
suggests were still buddies, “you're the first 
to bring up the intellectual-property angle. 
It’s brilliant. Once we get the protection 
in place, we'll sell the cloning rights to the 
highest bidder.” 

I harrumph non-committally. He's prac- 
tically rubbing his hands because I finally 
found another way to make money for him. 
A second chance. 

“We can’t patent your genome, after the 
Supreme Court said naturally occurring 
genes can’t be patented. We can’t copyright 
it either, as copyright requires a threshold 
of ‘original expression’ that’s nonfunc- 
tional. But I found Doctor Danzer here to 
help us.” 

“Directed methylation,” she says, and 
looks at me expectantly. 

Scott and I wait politely. But she is appar- 
ently finished. 

“Doc, you gotta explain more,” Scott 
pleads. “We're not PhDs here” 

She sighs. “It’s really very simple. As a per- 
son develops from a fertilized egg, the DNA 
gets small bits of hydrocarbons attached to it 
ina process called methylation, part of gene- 
expression regulation. It’s one of the main 
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ND CHANCE 


ways that the cells in your tissues are differ- 
ent from stem cells.” 

“How does this help you provide that bit of 
‘originality’ to my genes?” I prod. 

“ve invented a technique that will add 
methyl groups to your DNA ina specific, 
targeted way. ll focus on the non-coding 
segments of your DNA to minimize the pos- 
sibility of side effects. To make the copyright 
office really happy, we can use binary code to 
etch some message into your genes based on 
the positions of the methyl groups.” 

“Tl write you a poem.’ 

“That will do. Then we can copyright the 
whole methylated genome. Since demethyla- 
tion of somatic cells during cloning tends to 
be incomplete, if they try to clone you, some 
of the artificial methylation will survive into 
the clone” 

“Thus infringing my copyright” ’m 
catching on. “But what if they’re willing to 
pay the damages?” 

“Ha!” Scott slaps my back. “This is where 
it becomes genius. By registering the copy- 
right, we can get statutory damages. How 
many cells are ina human body, doctor?” 

“About a hundred trillion.” 

“At the minimum statutory damage rate 
of $750 per copy, that’s... many times the 
national debt. Who can afford to clone you 
without authorization?” 

I can see Scott is already planning on 
spending his commission. 


“You sure about this?” I ask. 

“As sure as anything involving epigenet- 
ics,’ Doctor Danzer says. I asked her to stay 
after Scott left. “I can direct the methylation 
process in such a way that cloned embryos 
will not develop properly. You won't be able 
to sell your copyright at all” 

“That'll be perfect,” I said. “Thank you, 
doctor” 

“You could have been great,’ she says. 

Ishrug. 

I don’t just want to control my genes for 
the rest of my life plus 70 years. I want to 
make sure that there will never be another 
me in the world, no copies that may excel the 
original. Call me vain if you want. I may have 
made my mistakes, but I want to be the only 
me in the history of the Universe. 

No second chances. m 


Ken Liu is an author and translator of 
speculative fiction. For more about him and 
his work, visit http://kenliu.name or follow 
@kyliu99. 
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